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SOME PROBABILITY ESTIMATES FROM CONTINGENCY TABLES 


C. D. Smith 


The problem of comparing joint outcomes from two samples with equal 
number N arises when it is desirable to arrange the data in mutually 
exclusive classes. In such cases the order of joint outcomes may be 
arranged in a contingency table. Karl Pearson calculated a coefficient 
of contingency and also applied the chi-square test. Wilks and others 
have given probability distributions for r-successes in samples of 
N, r < N, where a success is a mated pair of outcomes. For example, two 
samples of corresponding items are each classes as good (G), or bad (B). 
A success is indicated by the joint outcomes GG or BB, The order of all 
joint outcomes may be recorded in the contingency table 


G B 


GG GB 
BG BB 














Assuming that G has probability p and B has probability q the binomial 


(p + q) p* + 2pq + q* gives the order of the cell numbers where 2pq 
is composed of pq and qp. If p = q the cell numbers in the table are 
equal and the table may be represented with the number x in each cell. 
For p * q the cell numbers are not equal and it will be convenient to use 
the general notation as fol lows: 














The subscripts i and j indicate the respective rows and columes. Represent 
the sums of rows and columes by R; and C; respectively. The number of 
successes is given by L = x,, + X55. The weighted average number of 
all joint outcomes is given by A= (C,R, + C,R,)/N, where the number 
of a given outcome from the first sample is weighted with the number 
of the corresponding outcome from the second sample. 

We define a measure of relative discrepancy by 


(1) f = (L - A)/N. 


The characteristics of the measure of discrepancy are given by the 
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following results. For x, ,/x2, = x;2/%j, it follows that x,, * C,R,/N, 
X%>_ = C,R,/N, and L = A which gives f = 0. When the cell numbers are 
not proportional, L 7 A and f is either positive or negative. In this 
case for x,,/x2; 2 *12/%2>, L 2 A and f 2 0. The results mean that f 
is a measure of the failure of the table to give proportional numbers 
of joint outcomes. One extreme case occurs when all joint outcomes 
give the equal numbers GG = BB, In this case f = %. In the other 
extreme all outcomes give equal numbers BG = GB, In this caze f = -%. 
When f is calculated for all possible outcomes in samples of eight 
the distribution of f with interval .1 is approximately normal with 
mean zero and 9 = .2. Under these conditions we may assume that the 
distribution of f is approximately normal with mean zero and 9 = .2, 
for N not less than eight. 

In applying the method, the joint outcomes in two samples are arranged 
in a contingency table, the value of f is calculated, and the corre- 
sponding probability taken from the normal table of areas. A problem 
taken from the field of industrial management illustrates the practical 
value of the method. On the basis of aptitudes tests a group of 100 
applicants for positions as filing clerks are rated as good or bad. 
After due trial on the job each individual is classed by the record 
as good or bad. The question for management is the extent to which 
the tests may indicate expected success on the job. For a given case 
we have the table 


G B 


G] 45 20 | 65 
B} 10 25 | 35 


55 45 


L = 70, A= 51.5, and f = (L - A)/N= .185. From the table of areas 
at x/7 = .92 the probability is P = .3212. Since the probabality for 
improving the test estimates is given by increasing f one would expect 
better results to be possible in about 18% of all cases. In some cases 
this result may be considered good enough while in others it may be 
desirable to seek a better method of rating applicants. 

Other applications arise in quality inspection and control. The 














output from each of a group of workers or from machines may be classified 
both before and after a program for improvement has been carried out. 
Two inspections may be compared to determine the extent to which machines 
are losing efficiency through wear. In physics a method of prediction 
may be compared with later observed results to determine the value of 
prediction in such cases as, breaking strength, crushing weight, 
occurrence of daily rainfall, or daily maximum temperatures. These 
problems are mentioned to indicate the wide variety of important problems 
which may be studied by use of the contingency table and the value of 
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the relative discrepancy f, where an estimate of the probability of 
better correspondence is desired. 

The case for three mutually exclusive events has some interesting 
variations from that of two events. The classes may be represented by 
good (G), acceptable (A), and bad (B). With probabilities p,, pz, P3, 
of respective outcomes G, A, B, in each sample, the order of joint 
outcomes is given by the expansion of (p; + pz + p3)° = PiP; * P2P2 
* P3Ps * PyP2 * PoP; * PyP3 * PsP) * PoP3 * P3P2- The table of joint 
outcomes is as follows: 





G A B 
G | GG GA GB 
A | AG AA AB 
B | BG BA BB 











For cell numbers of joint outcomes Xo, we have for sums of rows R., 
for sums of columes C., and the sample number is N. A model experiment 
for the case of equal probabilities may be arranged as follows. Use 
two triangular regular prisms with one letter on each face. Place pegs 
in each end so that each, when thrown, will come to rest on one face. 
Throw the two prisms and take the letters on which they lie as a joint 
event. Repeat the experiment N times and record results in the con- 
tengency table. 


In the table of cell numbers x;. it turns out that x,, = C,R,/N, and 
similar results for X50 and X35 provided that Z,,/%o, = X1o/%o0 = 
X,3/%o5 = %_,/%3, = %0/Xa- 5 %53/%33° The number of like outcomes, or 


mated pairs, is L = x,, + x,, + x3, and the weighted average of all 
joint outcomes is A = 2C:R;/N. Here as before the sum of each row is 
weighted with the sum of the corresponding colume. For proportional 
cell numbers as given above it is true that L = A, otherwise L 7 A. 
One extreme occurs when all joint outcomes are mated pairs such that 
X11 * Xo * %33-Here f = (L - A)/N = 2/3. The other extreme occurs 


when all joint outcomes fall in the cells x In this case 


a “ees 
f = -1/2. The case of proportional cell numbers has been stated for 
which f = 0. It is now evident that for three mutually exclusive cate- 
gories, the values of f are distributed about f = 0 with range -1/2 to 
2/3. Obviously the distribution of values of f is not normal and esti- 
mates of the probability must be taken from a suitable frequency curve. 
In practical applications the use of Pearson’s Type III frequency 
curve has given satisfactory estimates for the probability of a given 
value of f. 

To illustrate a case of three mtually exclusive events, the following 
table represents 200 placement test scores paired with the record 
of the same 200 freshmen in algebra. 
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Test Score 
G A B 


G 25 4 0 29 
Algebra 4 | 5 80 13] 98 
Score 

B 0 ll 62 73 


30 95 75 200 














From the table, f = (L - A)/N = .44. In Pearson’s Type III with the 
Origin at the mode, the probability of a deviation greater than .44 
on the range extending to .67 is fairly small, and the placement test 
may be considered a good estimate of expected performance in algebra. 

In general the table for n-classes gives the positive extreme value 
of f = (n-1)/n when all pairs are mated such that x,, = %)9 = +++ * X,,) 
in which L = N. The extreme negative value f = -1/2 occurs when all 
pairs are mismated such that Xin *al? in which case L = 0. Evidently 
the degree of positive skewness increases as the number of classes 
increases. For n> 3, a suitable curve from the Gram-Charlier system 
should give good estimates of the probability that values of f may 
be greater than a given value. 


University of Alabama 
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COLLEGIATE ARTICLES 


Graduate training not required for reading 


THE TRANSFER DISTRIBUTION 
John Freund 


An important type of probability sequence is the sequence where the 
probability of each element (trial) is determined exclusively by the 
result of the immediately preceding trial. These sequences, which belong 
to the class which is commonly referred to as simple stationary Markoff 
chains, are called sequences with probability transfer. It is our purpose 
to develop in this paper the transfer distribution, i.e. the probability 
of obtaining ‘‘x successes in n trials,” if the probability of a success 
depends exclusively on the outcome of the immediately preceding trial. 

Let us denote a “success” by the attribute B and a “failure” by 
the attribute B (non-B). We shall consider as known, the following 
four probabilities which are defined as: 


i) the probability that a B will be succeeded by a B, or P(B,B') = 

Pry “ - 

ii) the probability that a B will be succeeded by a B, or P(B,B’) = 
Pia: ad 

iii) the probability that a B will be succeeded by a B, or P(B,B’) = 
Poy: * ee iin hse 

iv) the probability that a B will be succeeded by a B, or P(B,B’) = 
P22" 

It can easily be shown that the probability P(B), i.e., the probability 

of obtaining a B, regardless of the predecessor, is 
Poy 


P(B) = i (1) 
= Pig > Pay 





which follows directly from the equation: 
P(B) = P(B)P(B,B') + [1 - P(B))P(B,B’) (2) 


In order to simplify our notation, we shall write P(B) simply as p. 
The number of ways (combinations) in which we can arrange x B’s and 
5 


1 n . . . . . 
(n - x) B’s is |" In contrast to the Binomial distribution, where 


the probability of obtaining any one of these arrnngements is the same, 
we find that for transfer sequences, the probabilities which are assoc- 


Ithis terminology is due to H. Reichenbach, “The theory of probarility,” 
University of California Press, 1949. 
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iated with the various combinations depend on two factors. First of 
all, the probability will depend on whether the first element is a B or 
a B and secondly it will depend on the number of runs” which appear in 
the arrangement. In order to find the probability of obtaining x B’s in 


n trials, we shall use the following well known theorems from the field 
of combinatorial statistics: 


A) If we have x elements of one kind and (nm - x) of another, then 
the number of ways in which they can be arranged to form 2k runs 


1s 
z-l({n-zx- I 
te | 3) 
k- 1 k- 1 


The number of ways in which they can be arranged to form 2k + 1 
runs if the first element is a B, is 


welgers ss 


and if the first element is a B, it is 


x- l{n-x- 1 
7 bei ) 
k- 1 k 


In the above expressions the variable k goes from 1 to x if x <n - x, 
In order to simplify our expressions, we shall therefore assume that 
x <n- x, which does not offer any undue restrictions, because w can 
always calculate the probability of obtaining (n - x) B’s if n- x <x 
by interchanging the Pi; in our final results. 


If we have an even number of runs (namely 2k) and if the first element 
is a B, the following must be true in regard to the combination of the 
attributes: i 

i) there are k B’s which are preceded by B’s _ 
ii) there are k - 1 B’s which are preceded by B’s © 
iii) there are n - x - k B’s which are preceded by B’s 
iv) there are x - k B’s which are preceded by B’s. 
The probability of obtaining x B’s and n- x B’s which form 2k runs and 
where the first element is a B, is therefore 


i Ny(n- = - VT) gp pnt aes-k sok 
Py . ae p-y jeu2P ar P22 Pas P 


2A run is a sequence of letters of the same kind which is bounded by 
letters of the other kind, or which is at the beginning or the end of 
the total sequence. 

3See S. S. Wilks, “Mathematical Statistics,’ Princeton Univ. Press 1944, 
p. 203. 
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a 
in which any binomial coefficient f = 0, if a< b. If the first element 


is a B, the corresponding probability is 


_a* Bie 2 8 -Ik ene . 
<7 f . J k- 1 ler Pai1P22 (1 - p) (7) 


If we have an odd number of runs (2k + 1) and the first element is 
a B, the following mst be true for the sequence of attributes: 
i) there are k B’s which are preceded by B’s 
ii) there are k B’s which are preceded by B’s in 
iii) there are n - x - k B’s which are preceded by B’s 
iv) there are x - k - 1 B’s which are preceded by B’s 
The probability of obtaining x B’s and (n - x) B’s, which form 2k + 1 
runs and where the first element is a B, is therefore 


i qi “e's n-x-k_x-k-1 
Ps ‘a - <2 Pi 2P21Po2 ms «CF 


The corresponding probability, if the first term is a B, is 


Pp. = f ? if" ~ 2 1) pk nes-k-1 ak 


4 JP1 2P21P22 Py, (1 - p) (9) 


k-1)l sk 


The total probability of obtaining x B’s inn trials (if x <n- x and 
if x does not equal 0) is the sum of these four probabilities, summed 
on k, i.e. 


f(x) 


(x ~ Yim xe pt naman k mk 
lk - | —_— Pr2P21 Poo Py, P * 


me” 2° Beas ties Pi _ 
i guy JPr2 Pore (l- p)* 


2a * 2% pos~k_s-k-1. . 
k a Pi oPo1P eo Pry P 


“ 
x- T\(n- x - "| k hk n-x-k-1 x- -kiy wa 
As a k JPr2ParPe2 Pry P 


Substituting (1) into expression (11), we finally obtain 
g P 
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1 o* Dfe- 2-5 
“ > k _k _W-x-k_ xk 
site 1 - p,, * Pp, **! ae « ‘|| oe |t apt Pe Pyy 





Re mies? acd k _k+1 n-x-k x-k-1 
| \ Jet ars Poo Pay (12) 


k k- 1 


22 7 Tim eT ket kk nee k-t xmk 
oe k Pie ParPo2 Pry 


If x equals zero, we can obtain directly 


n-1 
P12P22 
- Pry * Pay 


f(0) = (13) 





It can easily be shown that in the special case where p,,; = Po; 


expression (12) will reduce to the well known formula for the binomial 
distribution. 

If n is large the Transfer Distribut*on approaches a normal dis- 
tribution which has the mean 


a= aP2t (14) 


1 - Py, * Poy 





and whose standard deviation is approximately equal to 


; oS Ca - p) hh , 2(py, - Pd 
” en es 








(15) 


where p stands for the probability P(B) which is given by expression (1). 
Formula (15) is not an exact formula, but an approximation which was 
derived by V. Bargmann*. If p,, = pz, = p then (14) and (15) reduce to 


the corresponding moments of the ordinary binomial distribution. 


4See H. Reichenbach, “The Theory of Probability,” p. 290. 


Alfred University 





VECTOR ANALYSIS 


Homer V. Craig 


1. The nature and history of vector analysis. For the purpose of 
orientation, vector analysis may be regarded quite advantageously 
(however somewhat inaccurately) as a generalization of the algebra and 
calculus of real numbers resulting from the replacement of the real 
number as the basic entity by the concept directed stroke or vector 
regarded as embodying the couplet of ideas magnitude and direction, 
Such a brief characterization is not completely satisfactory because 
it is an oversimplification from both the historical and the conceptual 
viewpoints. Historically, the situation is somewhat complex. Vector 
analysis as a mathematics of directed strokes was extracted from two 
more extensive theories - the Ausdehnungslehre of Grassmann (1840-44) and 
Hamilton’s Quaternion Analysis (1843-1844) - two works of first rank 
historical importance. Conceptually, the term vector is applied to 
more abstract entities than directed line segments in ordinary three- 
dimensional space. The spaces involved may be N-dimensional with N 
an unspecified integer, or they may have infinitely many dimensions. 
The basic patterns for these higher vector theories, however, derive 
quite naturally from the same parent theory, the elementary vector 
analysis based on the directed stroke. Thus we have on the one hand an 
elementary theory extracted from more comprehensive and profound theories 
but rooted in synthetic three-dimensional geometry, while on the other 
we have a contrasting theory (or set of theories) based on natural 
generalizations of the dominant algebraic structures of the elementary 
theory. To sum up, the vector analysis of directed strokes was an off- 
shoot of the theory of quaternions and the remarkably inclusive, 
multi-dimensional Ausdehnungslehre™— two revolutionary works which mark 
the beginning of a new epoch in algebra (both presented among other 
things the startling innovation of products of quantities such that 
ab # ba). 

Mathematics grows by virtue of certain processes which in degree 
at least are peculiar to it and the exact sciences. Among these, and 
of special interest to us at the moment because it is well exemplified 
by vector analysis, is the process of creative extension or modification 
of key entities, sometimes called generalization because the resulting 
theory may embrace the original as a special case. Such generalizations 
although they may seem simple in retrospect are evidently not easy to 
make, but once accomplished growth may be accelerated to an astounding 
pace. Some creative modifications take the form of questioning the 
urgency, efficacy, or validity of sanctioned viewpoints and assumptions. 
For example, one of the major developments of vector algebra stems from 
a denial of the urgency of the commutative law of multiplication, 
E. T. Bell, commenting on Hamilton’s quaternion theory and its non- 
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commutative multiplication, states: ‘‘It is radical departures from 
traditional orthodoxy such as these that carry mathematics forward 
what seems like a century or more at one stride.” 

As an instance of creative generalization, let us consider the 
step from arithmetic to algebra. In elementary arithmetic, we are given 
a set of abstract objects, the numbers, and two basic processes addition 
and multiplication. These processes are such that from each given pair 
of numbers there is determined a unique number called their sum and a 
second unique number called their product. In our work with these 
processes and their inverses (subtraction, division, and the extraction 
of roots), we are concerned almost exclusively with specific numbers. 
By simply replacing the specific numbers with letters denoting un- 
specified numbers we open the door to algebra, a subject which is at 
once vastly more extensive and vastly more powerful than the parent 
elementary arithmetic from which it arose. So long as the fundamental 
entity denoted by the letters is taken to be number (a single real 
number rather than a set of real numbers or something more abstract), 
this generalization is essentially of a purely logical nature. 

2. Vectors and their sums. Vector analysis, however, furnishes a 
generalization of an entirely different type. This generalization is 
obtained by replacing the entity number with a more complicated concept - 
the directed stroke, which involves the couplet of ideas number and 
direction. The number associated with the directed stroke is its length 
in terms of a given unit and the correlated direction is, of course, 
the direction of the stroke. Thus the overall program of elementary 
vector analysis is to create a mathematics of directed strokes in 
contradistinction to the mathematics of real numbers. The motivation 
for much of the development is supplied by elementary mechanics — a 
subject abounding in quantities involving direction such as: displace- 
ment, force, velocity, acceleration, etc. 

The earlier writers on quaternions, and Gibbs (Elements of Vector 
Analysis 1881, 1884; Velocity of Propagation of Electrostatic Force 
1896) have used Greek letters for vectors and reserved Latin letters for 
scalars, while other writers have done just the reverse. Still other 
schemes are to denote vectors by capitals or by letters with superposed 
bars. We shall follow the practice of Gibbs as set forth in the following 
quotation “As it is convenient that the form of the letter should 
indicate whether a vector or a scalar is denoted, we shall use the small 
Greek letters to denote vectors, and the small English letters to denote 
scalars. (The three letters i, j, k will make an exception...).” 

Vector Addition. Our first problem is the formulation of the 
definition of the sum of two vectors or directed strokes. What meaning 
shall we ascribe to the phrase the sum of two vectors? Before attempting 
to answer this question, let us examine some of the physical counter- 
parts of vectors. The first and perhaps simplest is the idea of straight 
line displacement - a particle or object is displaced in a straight 
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line from the point Ato the point B, We shall correlate this displace- 
ment to the symbol AB, which represents the directed stroke beginning 
at A and ending at B, We regard the essential part of the displacement 
as the fact that the particle was at A and now is at B. If we follow 
this given displacement with a second displacement say the displacement 
from B to C, then the single displacement which is equivalent to the 
two separate displacements AB and BC, in the sense that it takes the 
particle from A to C, is, of _course, the displacement AC, This suggests 
DEFINITION (2.1) AB + BC = AC 


B 
ox 
C 
Ki & g 
Ly B aC 
‘ Ke 


Fig. (2.1) 


With regard to this definition, the reader should bear in mind constantly 
two facts: (a) Definitions are logically arbitrary and are not subject 
to proof or disproof; (b) We are not defining the sum of the lengths 
of two strokes. The length of a stroke is a real number, and thus the 
sum of the lengths of two strokes is merely the sum of two real numbers 
and belongs to the realm of the mathematics of the real number system. 
We observe that since the straight line furnishes the shortest path 
between two points, the sum of the mere lengths of the vectors AB and 
BC is greater than or equal to the length of the vector sum AC, The 
equality holds only in case the three points A, B, C, lie on the same 
line. 

A second instance of a vector quantity is furnished by the physical 
concept force. If a force is applied to a particle the force cannot 
be described completely by specifying its magnitude alone. It develops 
however, that in this case the force is adequately described by giving 
in addition to its magnitude, its direction of application, i.e., by 
specifying its vector. Furthermore, it is found that the single force 
which is equivalent to two given forces @ and § acting on a particle 
is the force corresponding to the vector sum of a and £4 as given in the 
preceding definition. This means that if a and & are two forces acting 
simultaneously on a particle, then - (a + 4) is the single force which 
will wipe out the combined effect of the other two. In computing a + §4, 
we move £ by parallel displacement until its origin is at the vertex 
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or terminus of a and then draw the vector a + & from the origin of 
a to the new terminus of £, 

We now define two unrestricted vectors (i.e. unrestricted as to 
locality) to be equal if they have the same magnitude and the same 
direction. Thus, unless we have previously agreed to keep a given vector 
localized, we are free to move it as we please so long as we do not 
change its magnitude or direction. The equality of vectors (the symbol 
is =) as here defined is evidently less stringent than identity. The 
fact that our definition of the sum of two vectors is applicable to both 
displacements and forces augurs well for a wide applicability. 

We now turn to a second quantity that can be described by giving 
its magnitude and direction; namely, velocity. If a particle is moving 
from south to north, for example, with a speed of say ten centimeters 
per second, this information can be specified by means of a vector ten 
units in length extending from south to north. Thus velocities may be 
adequately described by means of directed strokes or vectors. Further- 
more, the assumption that velocitie: add vectorially is apparently in 
agreement with physical reality. To examine a concrete instance, let us 
suppose that a ball rolls westwardly along an east-west line drawn 
on a table with a speed of four centimeters per second relative to the 
table. Further, let us suppose that while this is happening the table 
is moving northward three centimeters per second relative to the floor. 
Relative to the floor, the ball is displaced four centimeters west 
and three centimeters north of its original position in one second. 
This displacement is in exact agreement with that determined by adding 
the two velocity vectors according to the rule given. 

The vector addition of velocities has an important application 
to the navigation of aircraft. The fundamental formula is: 

The velocity of the plane relative to the ground is the vector sum 
of the velocity of the plane relative to the air and the velocity of the 
air relative to the ground. 

Algebraic properties of the sum and difference of two vectors. 
We have defined the sum of two vectors a and £ to be the vector obtained 
by placing the beginning or origin of & at the terminus of a and drawing 
the stroke from the origin of a to the new terminus of &. In moving 
8 it is understood that neither the magnitude nor the direction of & 
is altered. We denote the sum of two vectors a and § by the symbola + £, 
The symbol + used in this way does not have the same meaning as the 
+ in 2 + 3 since the entities involved a, § and 2, 3, are entirely 
different in the two cases. Furthermore, as we have already pointed out, 
the length of a + & is usually less than a + b, the sum of the lengths 
of a and &. It is imperative that we keep in mind the distinction between 
a + 6 and a+ b. Simple geometric arguments will show that (a) a + 6 = 
8 +a, the commutative law of addition, and (b) (a + 8) + y=a+t+ (h+ 
y), the associative law of addition. This last relationship becomes 
fairly obvious if we select a point A for the origin of a, let £ be the 
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unique point such that AB = a, C and D the unique points such that 
BC = B and CD = y. The relationship (b), then assumes the form (AB. + 


BC) + CD = AB + (BC + CD). The effect of the addition in AB * BC, 
for example, is to leave out the intermediate point B and go directly 
from A to C, Thus both members of the relationship in question by 
leaving out the repeated letters reduce to AD. Evidently, this law 
can be extended to the addition of more than three vectors and should 
be retained in the postulational construction of a vector space of 
higher dimensions. To recapitulate, in the addition of vectors the 
order and grouping may be selected as we please. 

The difference of two vectors & - a is defined to be the vector ¥ 
which satisfies the equation a + y = 4, thus & - a is the vector 
having the property that if it is added to a the result will be 4, or 
in symbols, a + (8 - a) = &. This vector § - a may be obtained by drawing 
a and £ from a common origin and then drawing from the terminus of a 
to the terminus of 6 (see figure (2.2). It may be helpful in a specific 
case to observe the bearing of the vectors. Thus in the figure 4 extends 





Fig. (2.2) 


upward more than a while & has the greater rightward component, thus 
8 - a bears upward and to the left. Also, we should observe that our 
figure would collapse if & were equal to a. The symbol a - a is also 
denoted by 0, and we refer to it as the zero vector and regard it as 
representing a degenerate vector of zero length and indeterminate 
direction. 

3. Multiplication of a vector by a scalar. If x is any positive 
real number and a any vector then the symbol xa is by definition taken 
to represent the vector having the direction of a but having magnitude 
of a, Thus 2a is a vector having twice the magnitude of a but the same 
direction. The symbolic products 0a and (-1l)a are defined to be the 
zero vector and the vector having the magnitude of a but oppositely 
directed, respectively. We also make the definition -a = (-l)a. We 
may now define multiplication of a vector by a negative number ~x by 
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means of the relationship (-x)a = -(xa) thus -x times a is merely the 
reversal of x times a. To recapitulate, multiplication of a vector by 
a positive number other than unity stretches or contracts it, while 
multiplication by a negative number has the added effect of bringing 
about a reversal. 

Incidentally, the definition of multiplication of a vector by a 
negative number gives an interesting approach to the law of signs of 
elementary algebra. Thus (-1)[(-1)a) is the reversal of the reversal 
of a which is simply a itself. Now since we desire to have (-1)[(-l)a] = 
[(-1)(-1)]a, we should define (-1)(-1) to be +1, and similarly (-x)(-y) 
should be xy. 

4. Cartesian coordinates. A coordinate system in a plane is 
essentially a scheme for attaching ordered number pairs to points. A 
point 0 in a plane together with a set of two unit vectors i and j 
lying in the plane and mtually perpendicular constitute a basis for 
the familiar rectangular Cartesian coordinate system of elementary 
analytic geometry. The correlation between the number pairs (a,6) and 
the points is achieved by the agreement that the number pair (a,b) is 
to be associated with the terminal point P of the vector ai + bj when 
this vector is drawn from the origin 0. The numbers a and 6 are called 
the coordinates of P and also the components of the vector ai + bj. 
Thus to locate the point corresponding to the number pair (2,3) we 
start at the origin and generate the vector 21 by moving two units in the 
i-direction, then from this new position we turn through a right angle 
and move three units in the j-direction. This last displacement adds 
3j7 to 2i and hence our terminal point is the terminus of the vector 
2i + 3j when this vector is drawn from the origin. If we agree to let 
P be a radius vector, i.e., a vector whose origin is anchored at the 


(2,3) 











origin of coordinates, o, then we may say that the point corresponding 
to (2,3) is the endpoint or terminus of the vector ( given by P = 2i + 
3j. Thus the first number of the number pair is the multiplier of the 
first vector of the set i, j. Similarly, the point corresponding to 
-2, 3 is the terminus of ( for P = -2i + 3j. Multiplication of a vector 
by a negative number reverses it, hence -2i points to the left if t 
points to the right. Therefore, (-2,3) or more properly the point it 
represents, lies to the left of the origin. 
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In addition to the assumption that we have given the plane, the point 
0, and the unit vectors i and j we assume that: (a) given any two dis- 
tinct points in the plane A, B there exists a unique vector AB joining 
A and B; (b) any vector in the plane admits of a unique representation 
of the form ai + bj, and (c) any point A and any vector a determine 
a unique point B such that AB = a. These assumptions are sufficient to 
ensure that each number pair (a,b) determines a unique point and con- 
versely each point has a unique number pair correlated to it. 

It is customary to call the line through 0 containing the vector i 
the x-axis, and the line through 0 containing j, the y-axis. The 
coordinates of an unspecified point are frequently denoted by x and y. 
The important fact for us at present is that if the coordinates of P 
are designated by x and y, then the radius vector which goes from 0 to 
P is given by 9 = xt + yj. 

Cartesian coordinates in space. If to the pair of vectors i and j 
already introduced we add a third unit vector k perpendicular to the 
plane of i and j, then we have the basis for a space coordinate system 
which is a natural generalization of the plane coordinate system just 
considered. Specifically, the point corresponding to the number triplet 
x, y, z is the vertex of terminus of the radius vector ( given by 
p = xi + yj + zk. Thus to locate the point correlated to 2, 3, 4, we 
first generate the vector 2i by moving two units in the direction in 
which i points, then we turn through a right angle and generate 3j, and 
finally generate 4k by moving 4 units in the k-direction (which is 
perpendicular to the plane of i and j). The vertex of 4k in the present 
position of 4k is then the desired point. If i points from left to 
right, while j points away from us, and k points upward, then the vector 
p, p= 2i + 3j + 4k is generated by starting at the origin moving two 
units to the right, then three units away from us (j-direction), and 
finally four units straight upwards. If instead of positive numbers we 
had considered one or more negative numbers, the procedure would have 
been the same with the proviso, of course, that multiplication of a 
vector by a negative number calls for a reversal in direction. 

A change of coordinate systems. The transformation equations connect- 
ing two rectangular Cartesian coordinate systems with a common origin 
is of fundamental importance for a variety of topics in mathematics and 
mathematical physics. As an illustration of the algebraic use of geo- 
metric entities, we shall derive the transformation equation connecting 
the base vectors i, j and i, J» of two given Cartesian coordinate 
systems in the plane. It is convenient to introduce the rotational 
operator i which turns vectors through a right angle in the positive 
(counter-clockwise) sense. We first recall our assumption that any 
vector in a plane may be represented in terms of a set of base vectors 
i, j. Hence, numbers | and m exist such that i= li + mj. We now assume 
that the right angles between the coordinate vectors t, j and t, j, 
are both positive so that it = j while ii = j and ij = -i. The rotational 
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operator i, has the effect of rotating the vector to which it is applied 
through an angle of 90° in the positive sense which we have taken to 
be the sense of the shortest rotation of i into j and hence of j into 
-t. From the natural assumption that the rotation of a triangle through 
90° effects a rotation of all three sides through 90°, it follows that 
the operator i is distributive. Hence if we multiply both members of 
the equation i = li + mj by the rotational operator i, we get j = lii 
+ mij = lj - mi since iit = j, it = j, ij = -i. Thus the equations 
connecting the new base vectors i, j with the original base vectors 
t, j are 


(4.1) t= li + mj, jtcm+ Lj 

















Fig. (4.2) 


The square array of coefficients of i and j in the right members 
of (4.1), when the second equation is written directly beneath the 


l 
first, namely, | "| is called the matrix of the transformation. This 
© -m 


particular matrix has some remarkable properties which persist through 
the generalization of (4.1) to three and higher dimensional cases. 
Transformations of this type are of sufficient importance in mathematics 
and mathematical physics to warrant a listing of these properties. First, 
since i is of unit length and equal to li + aj, it follows from thes 
theorem of Pythagoras that 1? + m? = 1 and this means that the sum of 
the squares of the elements in any row or column is one. Also, the sum 
of the products of corresponding elements in two different rows or in 
two different columns is zero. To illustrate, using the two rows 
(horizontal) 1, m and -m, l, we have l(-m) + ml = 0. It will be very 
helpful for an understanding of later developments if we note carefully 
that this expression |(-m) + ml is the sum of the products of corres- 
ponding components of the vectors i and j since i = li + mj and j * 
-mi + lj. Furthermore, if we multiply the vector i by an arbitrary 
number a and j by some number 6, the resulting vectors would still be 
at right angles and the sum of the products of corresponding components 
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would still be zero, thus (al)(-bm) + (am)(bl) = 0. If we abbreviate 
by denoting the components of ai by a,a, and similarly the components of 
bj by b,, b,, then we have a,b, + a,b, = 0. Evidently, the expression 
a,b, + a,6, is of some importance since its vanishing is apparently 
connected with the fact of orthogonality. Later on, we shall see that 
this certainly is the case. 

In order to illustrate that the algebraic procedures involved in the 
manipulation of vector equations are no more difficult than those 
occurring in elementary algebra, let us attempt to solve the equations 
(4.1) for the vectors i and j in terms of the vectors i, and j. Examina- 
tion shows that this can be done by the very same procedures used in 
the corresponding problem in elementary algebra. Thus multiplying the 
first equation by l and the second by -m and adding, we get successively 
li = l7i + Inj, “mj = mi - mlj, Li -mj =i, since l* + m* = 1, Similar- 
ly, we have j = mi + lj and, therefore, the complete set of equations 
connecting lt, j and t, j is 


(4.2) t=li+m, jr-mi+ljy; i=li-m, jzmit lj. 


However, in one respect at least the vector equations are quite 
different from their real number counterparts. As an illustration, 
we shall find the transformation of coordinates induced by the trans- 
formation in the base vectors (4.2). Thus, let P be any point and let 
x,y and x,y denote the coordinates of P relative to the base sets t,j 
and i,j. This means that if 9 is the radius vector OP from the origin 
to the point in question then, on the one hand, 9 = xi + yj, while for 
essentially the same reason (the meaning of Cartesian coordinates) 
O= xi + yj. That is 


(4.3) xi + yj-= xi + yj. 


Substitution for i and Jy from the last two equations of the set (4.2) 
yields the vector equation. 


(4.4) xi + yj = xli - xmj + ymi + ylj. 


We have now come to an instance of an important principle of vector 
algebra: A vector equation in N-dimensional space is equivalent to 
N scalar (or real number) equations. In the case of (4.4), N is two; 
and the essential fact is that any vector in the plane at hand has a 
unique set of components relative to i and j. This means that the 
multiplier of t in the left member of (4.4) must equal the multiplier 
of t on the right. A similar statement holds of course with regard 
to j. Hence from the single vector equation (4.4) there emerges the 
patr of scalar or real number equations, 


(4.5) x = lx + my; y = -mx + ly 
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which constitute the expression of the new coordinates x and y in terms 
of the original coordinates x, y. Comparison of equations (4.5) with 
(4.1) reveals a remarkable fact; namely, the transformation of the 
base vectors (4.1) induces essentially the same (the technical term 
is cogredient) transformation in the coordinate variables, i.e. if 
in (4.1) we replace the vector symbols i, j, i, j, with the symbols 
for the coordinate variables x, y, x, y, then (4.1) becomes (4.5). 
Evidently, the solution of (4.5) for x and y is formally the same as 
the solution of (4.1) and hence we may read it out of the set of equa- 
tions (4.1). Thus the Complete set of coordinate transformation equa- 
tions is 


(4.6) x=lx+my, y=-mxet+ly; x=lx-my, y= mxt ly. 


Furthermore, since the components of a vector are merely the coordinates 
of its end point, when the vector is drawn from the origin, the con- 
ponents of a vector follow the transformation (4.6). To illustrate, 
if ais a vector in the plane at hand and a,, a, are its components 
relative to the base set t,j while a,, a, are its components relative 


to i,j, so that a= a,i + a,j =a L 


(4.7) a, = la, + maz, a@, = -ma, + la,. 


Similarly, if 4 is a second vector and b., b,; b, ,b. are its components 


relative to t, j and t, j, respectively, then 


(4.8) b, = lb, + mb, 6, = -mb, + Lb,. 

If we recall that the orthogonality of the vectors a and / implied 
the vanishing of the expression a,b, + a,b, and take account of the 
fact that a,, a,; 6,, 6b, are the components of a and / relative to 
any set of base vectors in the plane at hand, we see that whenever 
a,b, + a,b, vanishes, a,b, + a,b, must do so likewise. This remarkable 
fact leads us to investigate the effect of applying the transformation 
equations (4.7), (4.8) to the expression a,b, + a,6,. The algebra is 
quite simple and so we shall carry out the work in detail, thus a,b, + 
a,b, = (la, + ma,)(1b, + mb,) + (-ma, + la,)(-mb, + 1b,) = (1? + m®)a,b, 
+ Qa,b, + 0a,b, + (m? + l*)ayb, : a,b, + aob,. We now have reached the 
remarkable conclusion that a,6, + a,b, = a,6, + a,6, or, in more 
technical language, we have discovered that the expression a,b, + a,6, 
is invariant in form under a rotation of base vectors. The background 
ideas involved here are as follows: Given a plane, a pair of unit- 
orthogonal vectors i,j and a vector @ in the plane, there is a unique 
set of numbers a,, a, such that a = a,t + a,j. The numbers a,, a, were 


named the components of a. If we draw a new set of base vectors t, J 
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(unit-orthogonal), then as before a can be represented in terms of 1, 


j, thus, a= a,i + a,j. Thus a change in the set of base vectors induces 
a change in the components of a vector. The actual relationship between 
the two sets of components in the case examined (j = it, j = it rataer 
than j = -it) is given by equation (4.7). Since a is any vector in the 
plane, similar remarks apply to the representation of any second vector 
5, The sum of the products of corresponding components of two vectors 
a,6 (namely, a,b, + a,b,) however does not change but is invariant. 


Interpretation of the invariant a,a, + b,b,. Since the sum of the 
products of corresponding components of two given vectors 2 and / in 
the plane at hand has a value which does not change when we change the 
base vectors, we are free to simplify the computation by taking a special 
choice of base vectors. Obviously, if we take # along a then a = at 
(a is the length of a) and therefore a, = a, a, = 0. Also, if (a,8) 
denotes the angle generated hy the shortest rotation of a into 4, then 
by definition the cosine and sine of (a,/) are merely 6,/b and b,/b if 
b,, 6, and 6 denote respectively, the coordinates of the end point of 
8 when © is drawn from the origin, and the length of 4. The coordinates 
b,, 6, are also the components b,, 6, of &, thus the components of & 
are given by the equations 6, = 6 cos(a,S), b, = 6 sin(a,S). Con- 
sequently, we have a,b, + a,b, =ab cos(a,$) +0-b sin(a,) = ab cos(a,f). 

5. The dot product. The main features of the so called dot product 
of two vectors a and / are readily deducible from the extended rel ation- 
ship 


(5.1) a*8 = ab cos(a,f) = a,b, + a,b,. 


Here a*6 (read a dot &) is a new symbol defined by the first relation- 
ship. The numbers a,, a,; 6,, 6, as in the preceding section are 
the components of a and £ relative to a set of base vectors i,j in 
a plane containing @ and £. By assumption a vector (unrestricted) can 
be displaced from one point to another without change. Now since a°f 
by definition is ab cos(a,8), to know the meaning of the dot product 
it is sufficient to understand the symbols appearing in the product 
ab cos(a,8). Obviously, the dot product is a process which produces 
a single real number from a pair of vectors a and &, but there are 
of course infinitely many such processes (there are infinitely many 
functions of the angle (2,6) for example) and so two pertinent questions 
arise at once. Why emphasize this particular process by dignifying it 
with a special name and why call it a product? With regard to the latter 
query, it is customary to call a process which produces a unique entity 
from a given pair of entities x and y a product if it is distributive; 
i.e., if x(y + z) = xy + xz, this of course implies that the operation, 
or operations, + occurring here have already been defined. Hence the 
central question with regard to the propriety of calling a*B a product 
is this: Are the two real numbers a*(f + y) and a*f + a*y equal? The 
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key case is that in which Y lies in the a,5-plane. One method of attack 
is to prove the desired relationship by means of a figure. An alternative 
procedure is to use the equality a*/ = a,b, + a,b, and attack the problem 
algebraically. We shall adopt the latter procedure and denote the com- 
ponents of Y by c, and c, so that 6 + y = b,i + baj + c,i + ca = 
(b, + c,)i + (b, + c,)j. Thus from the fact that the dot product of two 


vectors is equal to the sum of the products of their corresponding 
components, we have 


¢,*a'po+ ary, 


a+(B+y) =a,(b, +c,) +a,(b, +c.) =a,b, +a,6,¢a.c, + a,c, 


2 
As a matter of fact, the fortunate circumstance that a*f is expressible 
as the sum of the simple products a,b, and a,b, assures us at once that 


it (a*6) has many of the properties of real numbers, and points the way 
to easy but important generalizations involving more dimensions. For 
example, since the order of the factors in the mltiplication of real 
numbers is immaterial, we have a*f = a,b, + a,b, = ba, + b,a, = fra, 
Simlarly, since the multiplication of real numbers is associative and 
distributive we have x(a,b, + a,b,) = (xa,)b, + (xa,)b, = a,(xb,)+ 
a,(xb,) and therefore x(a‘) = (xa)*6 = a(xf). 


On the basis of this evidence it is obviously justifiable to call 
the dot process a product. There remains, however, the question of its 
relative importance. For the present, perhaps it will suffice to say 
that it provides a very powerful method for the investigation of certain 
matters of a geometrical or physical nature and that the natural general- 
izations of the expression a,b, + a,b, play a conspicuous part in various 


higher dimensional geometries and physical theories. As an indication 
of the applicability of this product to elementary geometry and physics, 
we note that if a is a unit vector, then a‘*f is equal to the signed 
projection of & on to a. On the other hand, if a represents the dis- 
placement of a particle while acted on by a constant force 4, a*f is 
the work done on the particle by the force during the displacement. 
Projection and work are both concepts of cardinal importance. Perhaps 
it should be emphasized at this point that the vanishing of the dot 
product means that either one or both of the vectors vanish or that 
the vectors are at right angles. 

Before considering a specific application of the dot product, it 
will be helpful to observe that because the distributive and commtative 
laws hold, the products of sums and differences of vectors may be expanded 
as in ordinary algebra. To illustrate: 


(5.2) (a - £)*(a - 6) = (a - B)ra + (a - B)°h = 
at(a - 8) + p(a - B) = a*® - 2ab cos(a,&) + b?. 


Here we have used the fact that aa = a* and 6:8 = b* (since a and b 
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denote the magnitudes of a and 4), and cos 0 = 1). 

The law of cosines. An insight into the power, simplicity, and 
naturalness of vector methods may be obtained by discovering and proving 
the celebrated law of cosines, one of the most important theorems of 
elementary mathematics. The problem is to discover and establish a 
relationship between the lengths of the sides of a triangle. As a first 
step in the investigation, we draw a triangle and convert the sides 
into vectors by introducing arrowheads. We let two of the vectors say 
a and # issue from a common point so that the third vector Y will be 
equal to their difference. We denote the magnitudes of these vectors 
by a, 6, c and denote by the corresponding capitals the angles opposite 
these sides. Since y*Y = c* equation (5.2) may be rewritten as c* = 
a® + b* - 2ab cos C, which is the desired relation. 


y=oc -8 








Fig. (5.1) 


6. The equations of certain simple loci. Since a vector or directed 
stroke is itself a piece of a line, it is to be expected that the simplest 
and most intuitive equations of straight lines are the vector equations. 
As a first step in developing these equations we introduce the radius 
vector ©. This particular vector, it will be recalled, has its origin 
or initial point anchored at a certain given point O which will usually 
be taken as the origin of coordinates if a coordinate system is sub- 
sequently introduced. It is required in general that P be made to 
rotate and vary in magnitude in such a way that its vertex or terminal 
point will trace out the locus in question. The central problem in 
developing the vector equation of a locus is to find the conditions 
that © must satisfy in order that its end point may pick up all of the 
points of the locus and no others. 

Line through a given point ina given direction. As a first illus- 
tration, we present the equation of the line through the origin and 
having the direction of a given vector 4. Evidently, this equation is 
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simply 9 = uf with the proviso that u be a real variable of unrestricted 
magnitude. It will be recalled that for u positive, uf is a vector 
having the direction of & but possessing a magnitude equal to u times 
the magnitude of 4; while for u negative, the direction of uf is opposite 
to that of &. Thus we see that the role of the multiplier u is to stretch 
and contract and in addition to effect a reversal whenever it is negative. 
Thus since the origin of 9? is confined to the point O, the equation 
P = uf restricts 9 to the infinite collection of vectors obtainable by 
drawing & from O and then stretching and contracting / and its reversal. 
Thus for u = 1, ( is the vector 4 and the point determined is the end 
point of £. The point determined by u = 2 is the end point of 2/, 
while u = % is correlated to the midpoint of 8, & being drawn from the 
the origin. Similarly, u = -1 yields the end point of the reversal of A. 

In a like manner the equation of the line through a given point A 
and parallel to a given vector & is simply 


(6.1) p=a+uB 


if a is the fixed radius vector OA. The idea here is simply that all 
points on the given line and only these points may be obtained by stepping 
from the origin to A and then stepping various distances in the direction 
of & and -§, Here a serves to take us from the origin up to the line, 
while & acts as a pointer to indicate the direction of the line. 





Fig. (6.1) 


The normal equation of a plane. As a final application of the dot 
product, we develop the vector equation of the plane through the end 
of a given radius vector 5 and perpendicular to . As before, we let 
P be the variable radius vector whose vertex generates the locus in 
question—in this case the plane. Now since ( and ® both terminate in 
the plane, P ~ 5 lies in the plane and is therefore perpendicular to 6, 
thus we have at once 
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(12.7) (p - 6) *3 = 0, 


To visualize the geometrical background for this equation we may take 
for the given plane a table top and let 5 be one of the legs so that the 
origin is on the floor, then since every line on the table top is per- 
pendicular to the leg we see that 9 ~ © which lies on the table top 
must be perpendicular to ®. The vector ( is a variable vector which 
begins at the bottom of ® and ends in the table top. 


7. To more general theories and higher dimensionality. We have 
assumed that relative to a given set of base vectors, each vector has 
a unique set of components and conversely that each set of components 
determines a unique vector. The introduction of the components in a 
sense achieves an arithmetization of the geometrical concept vector. 
Consequently, we can think and work either in terms of a vector as a 
geometric entity (a directed line segment or stroke) or in terms of its 
arithmetical representation - the collection of its components. 

Components versus vectors. For some purposes (such as the problem 
of deriving the equation of a line through a given point and parallel 
to a given vector), it seems to the writer to be better to work directly 
with the vectors themselves as geometric entities rather than with their 
components. On the other hand, certain key expressions in the components 
such as a,b, + a,b, suggest very clearly how we may create certain 


higher dimensional geometries that will constitute satisfying general- 
izations of the familiar two and three dimensional Euclidean geometries. 

As a matter of fact, a variety of fruitful questions emerge from 
consideration of the components such as: Is it possible satisfactorily 
to attach a meaning to the term components of a vector relative to a 
cvrvilinear coordinate system (i.e., one in which the curves obtained 
by holding all of the coordinates but one fixed and allowing that one 
to vary are not straight lines in all cases)? Assuming this problem 
to be solved satisfactorily, what are the properties of the transformation 
equations connecting the components in two different coordinate systems 
of the more general type? Can systematic procedures be devised for the 
construction of invariants relative to this wider group of coordinate 
transformations? Can a satisfactory two dimensional vector analysis 
be built up out of the collection of vectors tangent to a surface such as 
the surface of a sphere? Such questions may lead to extensive mathematical 
developments and perhaps shed some light on the way mathematics grows. 

Multiple algebra. If in the two dimensional case, we center our 
attention on the two components (a,, a,) of a given vector @ it is a 


natural step, in retrospect at least, to attempt the formation of 
an algebra of number pairs. The facts relating to the components of 
vectors suggest the following definitions (a,, a,) + (b,;, 6,) = (a, + 
by, a, + b,); x(a,, a,) = (xa,, xa,). The first of these interpreted 
vectorially means that the components of a + & are a, + b,, a, + by - 


the sums of the corresponding components. With regard to mltiplication, 
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from our point of view only distributive processes are admissible as 
products. If we define the product of two number pairs (a,, a,) and 
(b,, b,) by the equation (a,, a,)(b,, b,) = a,b, + a,b, then the theory 
of course duplicates our two dimensional vector algebra. It should be 
observed, however, that multiple algebra has other possibilities. For 
example, the complex number a, + ia, can be represented by the number 
pair (a,, a,). Since (a, + ia,) + (b, + ib,) is defined to be a, + b, 
+ i(a, + b,), we have as before (a,, a,) + (b,, 6,) = (a, + b,, a, + b,). 
However, a new product (a,, a,)(b,, 6,) must now be introduced de fined 
as the number pair (a,b, - a,6,, a,b, + a,b,) to correspond to the 
definition (a, + ta,)(b, + ib,) = a,b, - a,b, + i(a,b, + a,b,). The 
number pairs (a, o) and (o, 6) are of course identified with the real 
number @ and the pure imaginary tb. The originally musterious number 
Y-1 now appears as the simple number pair (0,1) and (0,1)(0,1) = (-1,0) = 
-1. Complex numbers are frequently represented by vectors in the complex 
plane. The multiple algebra viewpoint brings out the similarities and 
differences between these two vector algebras—that of Gibbs and that of 
complex numbers. On the algebraic side both of the parent theories of 
Gibbs’ vector analysis, the theory of quaternions and the Ausdehnungslehre 
are multiple algebras. 

Higher dimensionality. Our next concern is to examine the problem of 
extending the notion dot product to three and higher dimensions. To make 
the step to three dimensions, we assume as in the section on Cartesian 
coordinates in space that we have given three mutually orthogonal unit 
vectors which we now denote by t,, i,, t, rather than by three different 
letters. By assumption, a given vector 2 has a unique representation of 
the form a = a,i, + a,i, + @,i,. The numbers a,, @,, a, are, of course, 
called the somponents of a relative to the given base vectors. Thus 
associated with each vector a there is a number triple (a,, a,, a,). 
If we proceed from the standpoint of multiple algebra, we are led as in 
the two-dimensional case to the definitions (a,, a,, a,) + (b;, 6, bg) 
= (a, + b,, a, + by, a, + bz) and x(a,, a,, a,) = (xa,, xa,, xa,). 
Obviously, the most direct generalization of the double algebra equivalent 
of the dot product, namely, the product defined by (@,, a,)(b,, 65) 
= a,b, + a,b, is the product defined by the equation (a,, a, a,)(6,, 
b,, 6,) = a,b, + a,b, + a,b,. If we take this to be the expression for 
a*8 in terms of the components relative to i,, i,, i,, then a*/ will 
be distributive and a‘a will reduce to (a,)? + (a,)? + (a,)*, the sum 
of the squares of the components. This last expression is the square of 
the length of a in Euclidean geometry since a, (a = a,t, + at, + a,t,) 
is a diagonal of a rectangular box of sides a,, a,, a,. Incidentally, 
if we consider successively the vector a,i,, then the vector a,t, + a,t, 
obtained by adding a,i, to the vertex of a,i,, and finally the vector 
a,i, + a,i, + a,t, obtained by adding a,i, to the preceding vector, we 
see that the high hurdle in computing their magnitudes occurs in the 
two-dimensionzl case. The length squared of the first vector is simply 
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(a,)*, while the length of the second vector is the length of the 
hypotenuse of a right triangle, and hence the theorem of Pythagoras 


is needed to show that la, i, + a,i,|? = (a,)? + ta7", However, we 


don’t need a new theorem to prove that la, i, + a,t, + a,i,|* 2 (a,)* + 
(a,)? + (a,)?. The reason is that the added vector a,i, is perpendicular 
to every vector in the (t,,t,)-plane, and therefore (a,i, + a,t,), taken 
as a Single vector, and a,i, form a right triangle. The square of the 
length of one leg is (a,)? + —, while the square of the length of 
the other is (a,)?, consequently, the square of the hypotenuse is (a, )? + 
(a, )? + (a,)?. If we take the next step and postulate the existence of 


a fourth vector l, which is perpendicular to each of t.. t., i. and to 


3 
every vector expressible in terms of them, then the vector a,t, should 


be regarded as perpendicular to (a,i, + a,i, + a,i,) considered as a 
single vector and it would seem that the theorem of Pythagoras is again 


ee 
applicable and if so it would give la, i, + a,t, a,t, + a,i,| = 


(a,)* + (a,)? + (a,)? + tai: This of course is a matter of pure 
supposition. In special relativity there is a consistent four dimensional 
geometry and the key expression may be written in the form (a,)? + 
(a,)* + (a,)* - (a,)?. Thus there is not just one four dimensional 
geometry but many such. It is of course necessary to build up a higher 
dimensional geometry carefully from a self-consistent postulational 
basis. In the writer’s opinion, it is better to proceed algebraically 
rather than by means of situations existing in two and three dimensional 
geometry. One might inadvertently assume, say, that if two planes have a 
point in common then they have at least a line in common. It isn’t 
necessary to investigate higher dimensional geometry in detail to feel 
uneasy about this assumption provided we look in the right direction. 
To illustrate let us set 9 = xt, + yt, + zt, + wi,, define the coordinate 
planes in the usual way, and then examine the vector equations for 
the (x,y)- and (z,w)-planes, namely 


p= xi, + yi, + Oi, + Oi,, = Oi, + Oi, + zig + wig. 


Equating corresponding components, we get as the one and only common 
point, the origin x = 0, y= 0, z= 0, w= 0. Again, if we let x, y, z 
be the coordinates in ordinary Euclidean space and let t denote the 
time, we may construct as in the theory of relativity a four dimensional 
manifold by taking point to mean the abstraction correlated to x,y,z,t - 
a location at a definite instant. If we set z= 0, t = 0, with x and y 
variable, we get the (x,y)-plane of our four-space, which from the 
three-dimensional viewpoint is the ordinary x,y-plane at the instant 
t = 0 only. The (z,t)-plane consists of the set of all points on the 
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z-axis, not for just one particular value of t, but for all values of ¢. 
The common ground of the (x, y)- and (z, t)-planes is again the “origin” 
x=0, y= 0,220, t= 0. 

Granting the need for careful procedure in the construction of higher 
dimensional geometries, it is urgent that such a concept as say or- 
thogonality unless it is selected as a basic undefined term should be 
defined explicitly. Orthogonality is more complicated in three dimensions 
than in two. The algebraic approach provides an easy way to precise 
definition. One way to proceed is to adopt a generalization of the dot 

roduct by generalizing the key expression a,b, + a,b, and then define 
Val, orthogonality of a and 4, cos(a, 4) through the generalization 
adopted, thus: |a| = Vara, orthogonality of a and 6 means |a| 7 0, 
| B| F 0, a*B= 0; cos(a, 6) = a:A/(\al|4]), fora 70, FO. 

The generalization, suggested by multiple algebra, of a,6, + a,6, is 
easily verified in the case of ordinary Euclidean 3-space with a set 
of base vectors i,, t,, t,. Since a ~ 5 = (a, — b,)i, + (a, - ba) ia 
+ (a, - 6,) i, it follows that la - Al? = (a, - b,)? + (a, - b,)* + 
(a, - b,)* = (a,)? + (a,)* + (a,)? + (b,)? + (b,)? + (b4)* - 2a,b, 
+ a,b, + agb,) = lal? + jal? - 2(a,6, + a,b, + a,6,). But since the 
vectors in the dot product (a - £)*(a ~ 8) can all be contained in a 
single lane, we have |a - 6|? = (a - 8)*(a - 8) = aca + Bf - 2a B= 
la| + [B|? - 2a°B, Comparison of these two expressions gives the desired 
relationship. 


a*6 = a,b, + a,b, + a,b,. 


8, The cross product. Just as the vanishing of the dot product implies 
orthogonality, there is a second product whose vanishing expresses 
parallelism—it being supposed, of course, that the vectors themselves are 
not of zero length. The parallelism of two vectors a and & beans that 
a= k6 or a, = kb,, a, = kb,, a, = kb, or a,/b, * a,/b, = a,/b,. By 
cross multiplying we get the following three equations a,b, - a,b, = 0, 
a,b, - a,b, = 0, a,b, - a,b, = 0. The left members of these equations 
are differences of products of components of a and / and, furthermore, 
there are three of them—just the number of components of a vector in 
three dimensional space. Thus we are led to consider the following 
vector expression: 


(8.1) i,(a,6, - a,b.) + i,(a,b, ~ a,b.) + i, (a,b, - a,b,). 


We should note here that the order of the subscripts in the positive 
terms is in agreement with the cyclic order 1, 2, 3, 1, 2, thus 1, 2, 3; 
2, 3, 1; 3, 1, 2. If we replace the components of / in the vector 
expression (8.1) with the components of /£ + Y, i.e., if we replace }, 
with 6, + ¢,, b, with b, + c,, 6, with b, + c,, as we shall do presently, 








spoRnm mr OG 


= ogo a & 


ee ee ee ee en ee 








ve 
3; 


or 


y; 





1951) VECTOR ANALYSIS 85 


it will become apparent that this vector expression which is constructed 
out of the components of the two given vectors has a distributive 
property. Hence (8.1) can be regarded as a product of the two vectors 
a and £, If the base set is right-handed (this means that i, has the 
direction of the thumb of the right hand when the curled fingers indicate 
the shortest rotation of i, into i,) then it turns out that (8.1) is 
the so called cross product of a and 4, We shall denote the cross product 
of a and & by the symbol ax and abbreviate (8.1) by just writing the 
first term i(a,b, - a,b,) + ***. The usual definition of the cross 


product, which may be shown to be equivalent to that above, is as follows: 
Definition (8.1) a x & = ablsin(a, 8)|v. 


Here a and 6 denote the magnitudes of a and # and v is a unit vector 
perpendicular to a and # and so directed that a, 4, v will constitute 
a right-handed system. If a and / are parallel or zero and hence do not 
determine ’, the product is assigned the value zero. 

Carrying out the details of the check for the distributive property, 
we have 


ax (B+ y) = i,[a,(b, + c,) - a,(b, + ¢,))] + °° 
* i, (a,6,- a,b.) + *** * t,(a,e,-a,c,) + *** = axPr axy, 


Unlike the dot product the cross product is not commutative but anti- 
commutative, this means that ax 6 = -6xa. This fact can be verified 
easily from the definition or by interchanging the order of the letters 
a and b in the expression (8.1). Also the cross product doesn’t admit 
of easy generalization to higher order spaces - drastic revision is 
necessary. Like the dot product it is associative with respect to 
multiplication by a scalar s in the sense that 


s(ax8) = (sa) x8 = ax (s§) 


The cross product finds wide and effective application in geometry 
and classical physics (particularly in mechanics and electricity and 
magnetism). Such fundamental concepts as the moments of force and momentum 
about a point 0 have their simplest expression in terms of the cross 
product. 

The higher products. Of the products involving more than two factors 
the most fundamental are the scalar triple product a*(8xy) and the 
vector triple product ax (xy). The scalar triple product has the 
following properties: (a) it is a scalar, (b) it is unaltered by an 
interchange of dot and cross (the vectors adjacent to the cross are to 
be grouped together), (c) its absolute value is equal to the volume of 
the parallelopiped determined by the vectors, (d) its vanishing means 
that one of the vectors is expressible in terms of the other two. This 
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product is fundamental in the step from unit orthogonal base vectors to 
the more general sets whose members are not orthogonal and not necessarily 
of equal length. If a, 8, y is such a set and a*(f x y) 7 0, then any 
vector ( is expressible in the form P = m + y{ + zy. To solve for x, 
y, z (the components of ( relative toa, §, y and likewise the coordinates 
of its end point in the new coordinate system when drawn from the 
origin), we multiply (dot product) by a**, 4*+, y**, these new vectors 
being defined as follows: a* = 8 x y/s, B* = y x a/s, y* = a x GB/s, 
s = a‘(8 x y). The values of x, y, z turn out to be Prat, p*h*, pry* 
respectively. 

The outstanding property of the vector triple product is its expansion 
formula, namely, a x (8 x y) = (a*y)6 - (a*S)y. Thus a x (6 x Y) is a 
vector expressible in terms of / and Y. 

9. The calculus of vectors. It is impossible to give an adequate 
account of the differential and integral calculus of vectors without 
assuming an extensive knowledge of the calculus of real variables; how- 
ever, a few brief remarks should suffice to give an idea of the flavor 
of this part of vector theory. The derivative of a vector function 
P(t) with respect to t turns out to be a piece of the tangent line to 
the curve ( = A(t). The definition of the derivative follows the pattern 
of the definition of dy/dx, i.e., do/dt = lim/p/At, 4o= p(t, + At) -p(t,). 
To illustrate, © = costi +sintjis the vector equation of a circle 
and here ( is a function of t. The derivative d?/dt is -sin ti + cos tj, 
a vector perpendicular to the radius vector costi + sintj (since 
P-dp/dt = 0) and therefore tangent to the circle. If the radius vector 
P is expressed as a function of arc length s along the curve, then 
dp/ds is a unit tangent vector, d*p/ds* is perpendicular to the curve 
in what is called the principal normal direction, and | d*p/ds*| is 
the curvature. Vector methods are ideal for the exposition of much of 
the differential geometry of curves and surfaces as well as for the 
discussion of mechanics, electromagnetism and other branches of physics. 


The University of Texas 
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EXTERIOR BALLISTICS 


John W. Green 


Two of the greatest English mathematicians of all times are two who 
have been associated over the past three or four decades, co-authoring 
over a hundred mathematical papers and setting a standard for mathematical 
analysis so severe that there almost seems to be none worthy of following 
in their footsteps. These are G. H. Hardy and J. E. Littlewood. 

Now Hardy is said to have once said of Littlewood, of whose technical 
facility he had the greatest respect, ‘‘Littlewood tried to make a decent 
science out of ballistics, and if Littlewood could not, who could?” 

There is unfortunately a certain amount of truth suggested by Hardy’s 
remark. I do not claim that ballistics is an indecent science, but it 
has to be admitted that it is an unbeautiful one, compared to many other 
branches of mathematics. It is applied mathematics at its extreme. One 
is not allowed to select problems to work on, but they are forced upon 
one; one does not follow up those aspects of a problem which seem 
attractive, but those which lead to an answer. There are a few exceptions 
to this - such as Gronwall’s interesting work on the qualitative pro- 
perties of the trajectory, but these are largely out of the domain of 
ballistics and into that of fancy. So far as I know, no bit of new 
basic mathematical knowledge has come from ballistics. However, into 
it vast quantities of knowledge obtained and techniques developed in 
other branches of mathematics have been poured. 

Neither do I claim that ballistics is uninteresting. It can be, and 
frequently is interesting; however it is usually the subject matter of 
the physical problem which is the interesting thing, not the mathematics. 
There is a certain amount of glamour and morbid fascination to the 
subject. 

Now up until fairly recently, most of the principal contributions 
to ballistics have come, not from professional mathematicians but from 
ordnance officers and astronomers. (An exception is Bashforth, an early 
British ballistician and clergyman!) The interest of the former in 
ballistics is obvious, and practically all the work up to 1914 is due 
to French, German, and Italian officers. These made many clever advances 
in the subject, and the theory developed by them was quite adequate 
for the weapons used before World War I. 

The appearance of the astronomers on the scene date to 1914 or there- 
abouts, and is a manifestation of the phenomenon of total mobilization 
of resources which war demands these days. The astronomers have four 
important qualifications which make them successful ballisticians. First, 
they have adequate backgrounds in the laws of physics and mechanics to 
handle the physical problems involved. Second, they are adept at numerical 
processes and are capable and energetic computers. Third, they have 
developed in astronomy techniques of pertubation calculations that lend 
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themselves ideally to ballistic problems. Finally, and perhaps most 
important, they are able to construct and operate the fine optical and 
other equipment which is essential to ballistic instrumentation. As a 
result, in recent times of emergency, the astronomers have tumed almost 
en masse to ballistics. 

During World War I, numerous mathem:ticians also turned to ballistics 
and made important contributions, principally in numerical methods of 
trajectory computing. More and more mathematicians have been turning 
to the subject in recent years, particularly as the distinction be tween 
ballistics and aerodynamics gradually breaks down and introduces the 
mathematically more difficult problems of aerodynamics. At the Aberdeen 
Proving Ground, the Army’s principal ballistic establishment, there was 
in World War II possibly the largest collection of mathematicians ever 
collected for an appreciable time. 

Now what does one study in ballistics anyway? I should say that I 
am restricting myself to exterior ballistics, which deals with the 
behavior of the projectile from gun muzzle to striking point. Interior 
ballistics, which deals with what happens in the gun barrel, and terminal 
ballistics, which deals with the damaging possibilities on striking, are 
quite different matters. 

Exterior ballistics (before rockets, at least) consists of the study 
of the motion of an unpropelled projectile under the action of whatever 
forces act on it. These forces are the force of gravity and the aero- 
dynamical forces of the atmosphere through which the projectile passes. 
The former is, of course, easily determined. It is proportional to the 
mass of the projectile and toward the center of the earth. Right here 
is a good place to introduce one of the most important ballistic facts 
of life - namely that there is no ballistic problem approximating a 
physical situation which is sufficiently simple to solve exactly. The 
keywords in ballistics are, simplify - approximate - idealize. So almost 
always the force of gravity is assumed to be constant in magnitude and 
direction instead of obeying Newton’s beautiful law. Not that one cannot 
solve dynamic problems with the exact gravitational law; one can, but 
taken in conjunction with the other complications usually present in 
ballistic problems, this refinement costs too much and is usually ignored. 
But more of this slightly later. 

Now when we turn to the second kind of force the matter is quite 
different. The motion of the atmosphere in the neighborhood of a pro- 
jectile is very complicated, and to discover what it is, one has very 
difficult partial differential equations to solve. Yet if one intends 
to find by theory and not by experiment the forces on a projectile, one 
must solve the equations. No one* has ever found by this method the 
forces on any projectile which would agree with those experimentally 
determined well enough to allow their use. What is done? One measures 
the force experimentally. This may be done in many ways, the use of 


*With the possible exceptions of the sphere and cone (infinite). 
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wind tunnels being the most obvious. That is, we hang the projectile in a 
wind tunnel and measure the force on it as a function of wind velocity. 
Now the system of force is very complicated. The principal force is 
called the drag and is directed opposite to the direction of motion. How- 
ever the projectile does not always point in the direction it is going, 
and so a transverse force, or lift, results. If the projectile is 
spinning, other forces act. However, what is usually done as a first 
approximation is to ignore all forces but the drag force. It is at this 





stage of approximation that ballistics proper sets in. That is, the 
study of the motion of a projectile is actually a difficult aero- 
dynamic problem. However, if all the forces are determined experimentally 
and synthesized into a single system of force depending on the velocity 
of the projectile, the resulting equations are ordinary differential 
equations instead of partial, and the study is called ballistics. 

Now the equations above have been the cause of much -back-breaking 
labor and the matter of the discovery of many clever ways of approximate 
solution—particularly by the early Italian and French ballisticians. They 
are not usually susceptible of solution in terms of elementary functions, 
and the modern method consists mainly of the strong-arm method of 
numerical step-by-step integration. There have been discovered three ways 
of effectively carrying this out by devices and machines of one sort or 
another. The first is the young lady named Mildred, or the like, who 
computes on the desk computer and grinds out a trajectory in perhaps 
12-48 hours. The second is the differential analyzer, which does it in 
a few seconds. In spite of these figures, the first is one of the best. 

Now when the trajectory is computed, one knows where the projectile 
will land. It is, however, one of the bitterest disappointments to those 
studying ballistics that this is not the way it is usually done. The 
military is decidedly averse to accepting the ballisticians’ prediction 
as to where the projectile will land. (Aren’t mathematicians notoriously 
careless with decimal points and such things?) In fact, the mlitary is 
disgustingly pragmatic in these matters and has always insisted on shoot- 
ing the gun and measuring the range to see where the projectile goes. 
If this is the case, where does the ballistician fit in, if at all? 
The answers: lie principally in the fact that it is not feasible to make 
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a sufficient number of experiments to determine the behavior of the 
projectile under all conditions where it might be used. Actually what 
is done is usually this: A few firings are made, the drag function 
estimated from the results, and the other trajectories computed on the 
basis of this drag function. 

Also, of course, in the case of bombs dropped from airplanes where 
the initial data are so hard to determine, and in anti-aircraft firing, 
where the final data are difficult to determine experimentally, the 
ballistician becomes even more essential. 

One of the most important problems which the mathematical ballistician 
has to solve is that of effects of small changes in conditions of one 
sort or another. Take for example the effect of the non-constancy of 
gravity and the rotation of the earth. The effect of these and many other 
similar non-standard conditions can be computed with the use of suitable 
mathematical analysis of certain differential equations related to the 
equation of the trajectory. Now when would one use, say, a correction 
for the rotation of the earth? This correction is not seriously large 
except on the very longest trajectories, and is very complicated to 
apply, depending as it does on the angle of fire, the initial velocity, 
the type of projectile, the latitude, and the direction of fire. Where 
it is used is in the reduction of the range firings which are carried 
out to make the firing table. A 40-foot error due to earth rotation 
may not be a bad thing to tolerate, particularly if the error is random 
due to firing in different directions. But if this were ignored in the 
range firings, it would prejudice every trajectory by this 40 feet and 
always in the same direction.* 

Now up until the last war, the most complicated problems which the 
ballistician had to solve were those connected with the spinning shell— 
and this is no simple problem. It is treated like a particle, plus 
correction for the fact that it isn’t. The gyroscopic effect of the spin 
has to be taken into account. But in the last war, new and more compli- 
cated problems arose. The first of these I mention are those in connection 
with the flight of bombs. Now a bomb trajectory is really just the last 
half of an artillery trajectory, so that there is nothing new there. 
However, the initial velocities are very small (400 feet/sec.), and the 
method of launching is such as frequently to introduce large yaws. (The 
yaw is the angle between the direction the projectile is going and the 
direction it is pointing.) Besides, bombs are usually light for their 
size, compared to shells, since they consist of mostly explosive and 
little steel. As a result, the force system is far from reducing to 
a simple drag. There is a considerable lift force also. Furthermore, 
the yaw does not remain in the same direction; it oscillates back and 
forth about the zero position. The result is a rather complicated analysis 


*Who says the direction is random? Not in a given war, anyway. 
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of differential equations with oscillatory solution, and an increase 
in the excitement in the life of the ballistician. 

The second kinds of new problem I mention are those in connection 
with rockets. Imagine everything as before, but with a force of variable 
strength propelling the rocket in the direction in which it is pointed. 
A very complicated problem, particularly since some of them spin also. 
These new problems give rise to new and more interesting mathematical 
problems on which ballisticians are spending their time. 

There is another place in ballistics where mathematicians and astro- 
nomers especially have been employed, and this is in the making of 
photographs of projectile in flight and measuring these photographs and 
reducing these measurements to physical data on position, velocity, 
and acceleration. The mathematics used has no appreciable connection 
with the theory of exterior Lallistics, and indeed consists mainly of 
difficult problems in solid geometry and trigonometry and computing. 
However, where there is computing, there is mathematics; namely the 
mathematics that provide ways of reducing the amount of computing. 

I have tried to give an idea of some of the problems which ballis- 
ticians encounter. Most of the people engaged in ballistics find their 
work nearly as much engineering as mathematics. In my approximate three 
years as a practicing ballistician, I found myself doing jobs as diverse 
as computing a trajectory on a desk computing machine, organizing fairly 
large scale trajectory computing projects for other computers, preparing 
bombing tables, designing a bomb-sight cam to build the firing table 
into the sight, and designing a fitting by which to hang a 1, 000-pound 
rocket to swing so that its moment of inertia could be found. 

Unfortunately ballistics seems to be here to stay, and employment 
possibilities in it are very good at the present. I hope we all live to 
see the day when it will exist no longer at all as a branch at least of 
applied mathematics. By the time of that millenium, perhaps someone more 
able than Littlewood will convert it into a respectable member of the 
family of branches of pure mathematics. 


University of California, Los Angeles 
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GAMES OF STRATEGY 


Melvin Dresher 


Introduction. We shall review briefly a relatively new branch of 
mathematics — the theory of games of strategy. This includes parlor 
games as chess, bridge, poker; economic games as the production and 
pricing of goods, and military games as defense against attack and timing 
of attack. In such games the players can influence the outcome of the 
game by outwitting each other. We shall omit games of chance, such as 
roulette, whose outcome depends entirely on chance and cannot be in- 
fluenced by the actions of the players except by cheating. 

Games of chance have been studied mathematically for many years, and 
the mathematical theory of probability was developed from their study. 
However, the problems of games of strategy were hardly considered until 
1928 when von Neumann wrote the first mathematical paper on the subject. 
This paper was later expanded jointly with Morgenstern in their book, 
Theory of Games and Economic Behavior, published in 1945. 

Von Neumann defined a game very generally. It is a set of rules which 
describe what moves each player may or must make under all possible 
circumstances, what moves are made for the players by chance, what 
information is available to the players and what payments are made to 
the players when the game is over. 

Initially, each game is described in terms of its moves, which may 
be personal (the moves are made by the decision of the players), or 
chance (the moves are made for the players by some random device). The 
first simplification of the description of a game is the introduction 
of the concept of a strategy. In the actual play cf a game, each player, 
instead of making his decision at each move of the game, may formulate 
a complete plan for playing the game from beginning to end, covering 
all possible contingencies. Such a plan is referred to as a strategy. 
The plan or strategy includes any information that may become available 
to the player in accordance with the rules of the game. In this way 
no freedom of action is lost through the use of a strategy since a 
strategy specifies the player’s action in terms of the information that 
might become available. In effect, a player selects a strategy and the 
strategy plays the game. 

Each possible way that a player has for playing a game from beginning 
to end is a strategy for that player. If we enumerate all his possible 
ways of playing the game, we shall obtain all the strategies for that 
player. Every combination of strategies, one from each player, will 
determine an outcome of the game which is measured by the payoff to the 
player. The totality of payoffs to a player can be represented by a 
payoff matrix. 

The fundamental problem of the theory of games of strategy is to 
determine for each player his best strategies, or the best ways to 
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play a given game from beginning to end. We shall discuss only games 
having two players and where the winnings of one player represents the 
losses of the other. All parlor games are of this type. 


Two Person “le Her”. We can illustrate the concepts of the previous 
section by means of the 18th century card game “‘le Her”. Such famous 
mathematicians as N. Bernoulli and Montmort attempted to solve this 
game but failed. Although the game was played by several persons, we 
shall describe it for two players. A dealer gives, at random, one card 
to a receiver and one to himself. The object of the game is to obtain 
the higher card. If the receiver is dissatisfied with his card he may 
compel the dealer to change with him, but if the dealer has a king, 
the dealer is allowed to retain it. If the dealer is dissatisfied with 
the card he holds, after the receiver has used his option, the dealer 
may exchange it for another card drawn at random from the deck. However, 
if the card he then draws is a king, he must keep the card with which 
he was dissatisfied. If the dealer and receiver have equal cards, the 
dealer wins. 

This game consists of three moves. The first move is a chance move - 
dealing one card each, at random, to the receiver and dealer. The second 
move is a personal move by the receiver — exchanging his card with the 
dealer or staying with the original card. The third move is a personal 
move by the dealer - exchanging his card with a card from the deck or 
staying with the card he holds. 

The game can be summarized by defining a strategy for the receiver 
to be a determination of change or stay for each of the 13 cards. One 
such strategy might be written as 


2 8.6.0 8672.83.39 B.4..9 5 


c s c Ss Ss c c Ss s c s c s 


where c means change and s means stay. This strategy tells the receiver 
to change if he receives an ace, stay if he receives a two, change if he 
receives a three, ..., change if he receives a queen, stay if he receives 
a king. Note that the strategy is a complete set of instructions. It is 
apparent that the receiver has i strategies. The dealer’s strategy can 
be described in the same way, and therefore he also has tg strategies. 
It is not necessary to compute each payoff in the entire 2'3x2'3 matrix, 
for it is apparent that any strategy in which exactly r changes of cards 
appear is dominated by a strategy in which the first r cards are changes 
and the remaining 13 - r cards are holds. It can be verified that of the 
2'> strategies of each player, 2'* - 2 of them are poor strategies - 
two strategies dominate all the others. Thus after all the poor strategies 
are removed, the game of “le Her” reduces to the following 2X 2 payoff 
matrix 
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PAYOFF TO DEALER 
(Probability of Winning) 
Receiver’s Strategies 


Hold 7 and over Change 7 and under 





16,182 16,122 
Hold 8 and over 33.150 33150 
Dealer’s 
Strategies 
16, 146 16, 182 
Change 8 and under 33.150 33 150 











We shall solve this game in a later section. 

Minimax Theory. The mathematical model of the two-person zero-s um 
game can be described in the following way: Player I chooses a strategy 
t from his m strategies and player II chooses a strategy j from his n 
strategies, each choice being made without any knowledge of the other. 
The payoff to player I is @; and the payoff to player II is “a; 5. Player 
I wishes to maximize a. but he controls only the choice of 1; player 
II wishes to maximize -a,., or to minimize a.. but he controls only the 
choice of j. Any solution of the game should resolve these conflicting 
interests. 

Player I has some strategy, t*, such that he can receive at least 


max min a; 5. Player 'T has some strategy, j*, such that he pays at most 
t J 


min max a... In general, 


J 
j t 
< , > < 
a;;« < min max a;; > max min a;; * ajo, 
J t t J 
for all t and j. If the game is such that max min a;; = min max a, ,, 
i j j i 


then we refer to i*, j* as optimal strategies of player I and II re- 
spectively, and Qieje as the value of the game. The optimal strategies 
have the property that if player I chooses i*, he is certain to receive 
at least Giese and if player II uses j)* he can make sure that Giese 
is the most that I receives. Even if player I were to announce in advance 
that he plans to play strategy i*, player II could not take advantage 
of this information and thereby reduce I’s winnings. A necessary and 
sufficient condition that a game have optimal strategies i*, j)* is that 


there exists an element Gieie of the payoff matrix which is simultaneously 


the minimum of its row and the maximum of its column. 
If the payoff matrix A = ll; ; || is such that 


max min a.. “ min max a, 
2. a 
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then there exists no single strategy that is optimal. In such a game 
a player can pick a best strategy only if he knows the strategy chosen 
by his opponent. But by the rules of the game this is impossible for 
the players must pick their strategies simultaneously. In such a game 
it is also important that a player’s chosen strategy be unknown to his 
opponent. This can be accomplished by mixing the strategies. A player, 
instead of choosing a single strategy, chooses a probability distribution 
over the whole set of strategies and a random device selects a particular 
strategy, subject to the given probability distribution. Such a proba- 
bility distribution is called a mixed strategy. 

The fundamental theorem of games of strategy, first proven by von 
Neumann, states that if each player has a finite number of strategies, 
then each player has an optimal mixed strategy. That is, player I has 
a mixed strategy such that if he uses it he can expect to win at least 
an amount v, and player II has a mixed strategy such that if he uses it 
he can expect to lose no more than the amount v. Thus there exist mixed 
strategies Xx) Yo for players I and II respectively such that 


X'AY, < v = X{AY, < XAY, 


for all mixed strategies X and Y of the two players. The optimal mixed 
strategy also has the property that it may be announced, in advance, and 
an opponent cannot take advantage of this information. 

Solution of “le Her”. Applying the fundamental theorem to the game 
“le Her’’ , we find that the optimal strategy for the dealer is to mix 


his two strategies with probabilities , d, respectively, and for 
the receiver it is to mix his strategies with probabilities (°, 3) The 


value of the game to the dealer is .487 and to the receiver it is .513. 
It is interesting to note that the dealer is at a disadvantage in this 
game, although all the rules seem to be in his favor. This is partially 
explained by the fact that the receiver has the first move and is in a 
more powerful position to influence the outcome of the game. 

Three-Finger “Morra”. A well-known finger game player by two people 
is the game of morra. Each player shows one, two, or three fingers and 
simultaneously guesses the number of fingers his opponent will show. 
If just one player guesses correctly, he wins an amount equal to the 
sum of the fingers shown by himself and his opponent; otherwise the 
game is a draw. The payoff matrix is given by: 
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(1,1) (1,2) (1,3) (2,1) (2,2) (2,3) (3,1) (3,2) (3,3) 
(1,1) 0 2 2 -3 0 0 -4 0 0 


(1,2) -2 0 0 0 3 3 -4 0 0 
(1,3) -2 0 0 -3 0 0 0 4 4 
(2,1) 3 0 3 0 -4 0 0 -5 0 
(2,2) 0 -3 0 4 0 4 0 -5 0 
(2,3) 0 -3 0 0 -4 0 5 0 5 
(3,1) 4 4 0 0 0 -5 0 0 -6 
(3,2) 0 0 -4 5 0 0 0 -6 
(3,3) 0 0 -4 0 0 -5 6 6 0 


This game has an infinite number of optimal mixed strategies. Every 
convex linear combination of the following four mixed strategies is 
an optimal strategy: 


Se és t ek ee ee 

TY 12 oa 7 

(o, 0, 16 o, 22, 0, 2 o, 0) 
37 37 37 

(o, o, 20 o, 15 9, 12 9, 90) 
47 47 47 

(o, 0, 25, 0, 20 9, 16 9 9) 
61 él 61 


Since the game is symmetric, the value of the game is zero. 

Games with Infinite Numbers of Strategies. So far we have assumed 
that each player has a finite number of strategies. Many games have an 
infinite number of strategies. For example, economic games in which the 
strategy consists of determination of a price or a percentage, have 
a continum of strategies. The mathematical model for such a game is 
the following: Player I chooses a strategy x and player II chooses a 
strategy y, each satisfying 0 < x <1, 0 < y < 1. The payoff to player 
I is given by a function M(x,y). Since the game is zero-sum, the payoff 
to player II is -M(x,y). 

Whereas every finite game has an optimal strategy, some infinite 
games do not have optimal strategies. However, if M(x,y) is a continuous 
function of x and y, then each player always has an optimal strategy. 
In general it is very difficult to determine the optimal strategies 
of an infinite game. We give some examples of infinite games and their 
solution. 

Allocation of Forces. Two targets of value | and k are to be de fended 
against attack. How should the defender distribute his total defensive 
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force among the two targets and how should the offense distribute his 


attacking force among the two targets? A convenient payoff to the offense 
can be defined by 


- if x > 
M(x,y) = ‘teil * oe . 


k(x - y) if x<y 


where x is the fraction of the offensive force sent against the target 
of value 1, and y is the fraction of the defensive force placed at that 
target. 

Solution: The offense mixes his strategy - he attacks a single target 
with all his forces but selects the target at random subject to the 
following distribution function 


F(x) = 





l k 
iis) I, (x), 
+ k 1 +k 


where I,(x) is a step-function with unit jump at x = a, 





The defense uses a pure strategy — places of his force at target 





with value l, and of his force at target with value k. 


k 
l+k 
Games of Timing. The theory of games of strategy can be used to solve 
a large class of timing problems. In these problems, the rules of the 
game describe the actions which the players are to take, but the timing 
of the actions is left to the players. The situation is such that each 
player wishes to delay the actions as long as possible but he is also 
penalized for delaying. This conflict of interests can be resolved and 
a best timing of action can be determined for each player. A duel between 
two persons is an exanple of a game of timing. 


1 + 


Expectation: Offense can expect to win 





Game of Timing with Information. Each duelist is informed of his 
opponent’s action, firing, as soon as it takes place. The two duelists 
close in on each other with no opportunity for retreat. The accuracies 
increase steadily. The payoff is 1 to the surviving duelist and 0 if 
both or neither survives. In terms of the accuracies, the payoff is 
given by the following payoff function: 


2P (zs) - 1 ifx<y 
M(x,y) = P,(x) - Po(x) iff x = 
1 - 2P,(y) if x>y. 


where P,(x), P,(y) are the accuracies of the two duelists at time x and 


y respectively. 
Solution: Each duelist fires at x* where P,(x*) + P,(x*) = 1, if his 
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opponent has not fired already. If opponent has fired at x*, then fire 
at x such that P, (x) = 1. 
Expectation: Player I wins P,(x*) - P,(x*). 

Game of Timing without Information. Each duelist is ignorant of any 


firing by the other. Assume that the duelists have the same accuracy. 
functions. Then the payoff function is given by: 


-y¢ (i ¢ yin if a < y 
M(x,y) = 0 ifx=y 
9+ He gme if as 
where x and y are the kill probabilities. 
Solution: Never fire before x = 1. From x = 1 to x = 1, fire with 
3 
density 
1 
dF(x) = —— dx. 
4x3 
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MISCELLANEOUS NOTES 
Edited by 


Charles K. Robbins 


Articles intended for this Department should be sent to Charles K. 
Robbins, Department of Mathematics, Purdue University, Lafayette, Indiana. 


THE LOG LOG SCALES OF THE SLIDE RULE 


In certain computations using the log log scales on a slide rule, 
there may be uncertainty either about the section in which the answer 
is located or about the position of the decimal point. A procedure 
for eliminating these doubts is presented herewith. Pertinent to this 
procedure are the following definitions: 

Definition 1. When the slide extends to the right at the completion 
of an operation, such condition is designated as Status I. 

Definition 2. When the slide extends to the left at the completion 
of an operation, Status II exists. 

Definition 3. The adjusted characteristic of the logarithm of a 
number is simply the characteristic plus unity. Adjustment is made when 
Status II exists. 

The procedure mentioned above is based upon the following rule. This 
rule is expressed with the understanding that the sections LL1, LL2, 
and LL3 are numbered 1, 2, and 3, respectively; and, likewise, LLO1, 
LLO2, and LLO3 are also numbered 1, 2, and 3, respectively. 


Rule. Consider Z = X'. The number of the section on which a power Z 
is found is equal to the number of the section on which the base X is 
found plus the characteristic of the logarithm of the exponent Y (the 
latter being adjusted if required by status). 


Proof: To facilitate the presentation of the proof, the following 
nomenclature is adopted: 

k. = the number of the section bearing the number J; 

+. the characteristic of the logarithm of J; 


M; = the mantissa of the logarithm of J; 


C. = the characteristic of the logarithm of J, adjusted when re- 
quired by the ststus of an operation. 


In terms of these symbols, we wish to prove that for Z = x’, 


OI 


ee 
z x y 
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With reference to Z = X*, suppose X to be greater than unity. This 
is the case where the log log scales greater than one (LL1, LL2, and 
LL3 on some slide rules) are used. Taking the natural logarithm of 
both members, we obtain: 


In Z= (ln X)Y 
Now taking the common logarithm of both sides, we have: 
log(In Z) = log(in X) + log Y 
or, M + C = M + Cc, 


In z nz ln x 


+ M +C (1) 
y 


nz y 


If there is no carryover upon adding M),, and M, (Status I exists), 
then 


M, , * Ms 28 M, (2) 
Substituting (2) in (1): 
C = ¢ + C (3) 
In z In «x y 


If there is a carryover upon adding M, . and M, (Status II exists), 
then 


M + 1 =M + M ( 4) 


ln z In«x y 
Substituting (4) in (1): 
C '¢ +(C +1) (5) 


In z nx y 
We may summarize (3) and (5) in the expression: 
C = C + C (6) 
where C_ is the characteristic of the logarithm of Y, adjusted when 
required by the status of the operation. 


We will now show that a result similar to (6) holds for the case, 
X <1. If X in Z = X’ is less than unity, then In Z and ln X in 


In Z= (ln X)Y 


are both negative. Hence, it is necessary to multiply both members 
by -l, before taking the common logarithm of both sides: 


log(-ln Z) = log(-ln X) + log Y 
Or, C + M =C + M +C +M 
Y 


-lnz -ln z -lnx -Ilnx 


Y 
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From this, following the process used for X > 1, we derive: 


. = C + C¢ (7) 
-lnz -lnx y 
Now consider any number P within range of the log log scales. If we 
take P > 1, we would find it on a section of the log log scales greater 
than one. If P < 1, we would find it on a section of the new log log 
scales less than one (LLO1, LLO2, and LLO3 on some slide rules). By 
consulting a table of natural logarithms, we can show that if 


P is greater than one, Clap = k, ~ Be 


ek - 3. 


P is less than one, C404 " 


Since P was any number, for X and Z, in particular, we have: 


c =k -3 (8) 
ln x x 
C zh «§ (9) 
ln z z 
and, C =k -3 (10) 
-Inx x 
ane se (11) 
-in z z 


Substituting (8) and (9) in (6), and substituting (10) and (11) in 
(7), we obtain a common result, namely: 


s o8. ¢¢ 
z x y 


Now note the following examples: 


Example 1. Evaluate 6.35%. 
Solution: Set the hairline to 6.35 which is on the LL3-scale. Draw the 
left C-index under the hairline. Move the hairline to 300 on the 
C-scale. Since Status I exists, k = 3 + 0 = 3. Thus, the answer, 256, 
is read on the LL3-scale., 


Example 2. Evaluate 2.335°°7°, 
Solution: Set the hairline to 2.335 which is on the LL2-scale. Draw the 
right C-index under the hairline. Move the hairline to 750 on the 
C-scale. Since Status II exists, k = 2 + (-1 + 1) = 2. Thus, the answer, 
1.889, is read on the LL2-scale. 


Example 3. Evaluate 275°:°°%, 
Solution: Set the hairline to 275 on the LL3-scale. Draw the right 


1. It may be helpful to note that in taking the natural logarithm of both 
members of 6.35” = x: (ln 6.35)3 = In x, we have immediate indication that a 
multiplication is to be performed in which 6.35 is to be located on a log log 
scale, 3 on the C-scale, and in which x is to be read from a log log scale. 





104 MATHEMATICS MAGAZINE 





C-index under the hairline. Move the hairline to 400 on the C-scale. 
Since Status II exists, k = 3 + (-3 + 1) = 1. Thus, the answer, 1.0227, 
is read on the LLl-scale. 


Example 4. Evaluate 0.05°:°°'?, 
Solution: Set the hairline to 0.05 on the new log log scales less than 
one, or on the LLO3 scale. Draw the right C-index under the hairline. 
Move the hairline to 612 on the C-scale. Since Status II exists, k = 
3 + (-2 + 1) = 2. Thus, the ansser, 0.832, is read on the proper LL2 
or LLO2 scale under the hairline. 


Example 5. Find x in 17= x‘, 


Solution: We note that x = 17'74. Set the hairline to 17 on the LL3- 
scale. Draw 400 on the C-scale under the hairline. Since division is 
used and since Status II exists, k = 3 - (0 + 1) = 2. Thus, x = 2.03 
since the left C-index is over 2.03 on LL2. 


Example 6. Evaluate 7.3126/4'-5, 
Solution: Set the hairline to 7.31 on the LL3-scale. First draw 415 on 
the C-scale under the hairline. At this stage, Status II exists—there- 
fore, k = 3 = (1 + 1) = 1. Move the hairline to 260 on the C-scale. 
Status II still exists, so k= 1 + (1 + 1) = 3. Thus, the answer, 
3.48, is read on the LL3-scale. 


Note: The problems which follow are of the type, Z = X¥, where X 
and Z are known constants and Y ix to be found. Since Y is a value which 
is to be taken from the C-scale, the position of the decimal point must 
be determined. In accordance with the rule given in this article, 
k =k +C.; or C. =k -k_. But, C. equals: either C, or C, + 1 

z x y y z x y Y Y 
according as Status I or Status II exists, respectively. So, when Status 
I exists C, = k. - ki; and when Status II exists, C, = k. ~ k = l, 


Example 7. Find x in 15 = 3% 
Solution: Set the hairline to 3 on LL3. Draw the left C-index under the 
hairline. Move the hairline to 15 on LL3. Since Status I exists, C. a 
3 - 3 = 0. Thus, the answer, 2.46, is on C under the hairline. 


Example 8. Find x in % = 1.72. 
Solution: Set the hairline to 9 on LL3. Draw the left C-index under the 
hairline. Move the hairline to 1.71 on LL2. Since Status I exists, 


C. = 2-3 = -l. Thus, the ansser, 0.247, is on C under the hairline. 


Example 9. Find x in 2.03% = 17. 
Solution: Set the hairline to 2.03 on LL2. Draw the right (C- index under 
the hairline. Move the hairline to 17 on LL3. Since Status II exists, 
C_*#3-2-l1s 0. Thus, the answer, 4, is on C under the hairline. 


University of Tennessee E. L. Eagle 
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CURRENT PAPERS AND BOOKS 


Edited by 
H. V. Craig 


This department will present comments on papers previously published in the 
MATHEMATICS MAGAZINE, lists of new books, and book reviews. 

In order that errors may be corrected, results extended, and interesting 
aspects further illuminated, comments on published papers in all departments 
are invited. 


Communications intended for this department should be sent in duplicate to 
H. V. Craig, Dept. of Applied Mathematics, Univ. of Texas, Austin 12, Texas. 


The Theory of Algebraic Numbers. By Harry Pollard, Carus Mathematical 
Monograph, No. 9. Math. Assoc. of America, 1950, 142 pages. 


This monograph gives a concise account of the elementary parts 
of nineteenth century algebraic number theory, culminating in applica- 
tions to Fermat’s last theorem and a proof of the Dirichlet-Minkowski 
unit theorem. The expositionis clear, and the proofs elegantly conden ed. 

The hypothetical reader for whom the book is intended is not only 
assumed to be ignorant of German (otherwise he would read Hecke and 
Halbert) but also of the most elementary facts of ring theory as set 
forth for example in the first three chapters of McCoy’ s Carus Monograph 
Rings and Ideals. On the other hand, a certain mathematical maturity 
is required to follow the details of the proofs. Perhaps a senior or 
first-year graduate student, docile and intelligent, with an excellent 
grasp of calculus, but only a rudimentary acquaintance with modern 
algebra and modern languages, best meets these requirements. 

For other readers, there are the great German classics, or the 
admirable Princeton Monograph of Weyl on Algebraic Numbers. This mono- 
graph is certainly more difficult than Pollard’s. In compensation, the 
reader who masters it will understand not only Dedekind’s ideas, but 
also the root ideas of Kronecker, Hénsel and Hilbert. It is these ideas 
which have led to the development of modern valuation theory and class 
field theory, the main research fields for twentieth century work in 
algebraic numbers. 


Morgan Ward 


The Arithmetic Theory of Quadratic Forms. By Burton W. Jones, Carus 
Mathematical Monograph, No. 10, Math. Assoc. of America, 1950, 212 
pages. 

This excellent monograph gives a complete and self-contained account 
of the central ideas of the arithmetic theory of quadratic forms. Only 
the fundamentals of matrix algebra and elementary number theory are 
presupposed, but the reader is led by easy stages to the very frontiers 
of present-day knowledge. 

The arithmetic theory of quadratic forms has been completely trans- 
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formed in the last thirty years by the researches of Hasse, Siegel, 
Pall and Jones, himself. In fact many of the methods of proof and results 
are the author’s own, and are presented here for the first time. Certain- 
ly the book should do much to encourage further research in this 
fascinating field. 

The book is beautifully planned and written, distinguished not only 
by what is included, but by what has been left out. It is certainly 
one of the best of the Carus Monographs published to date, and no one 
at all interested in the Theory of Numbers can afford to miss it. 


Morgan Ward 


Editor’s Note 


The editors of the Mathematics Magazine like either to accept sub- 
mitted articles or give good and sufficient reasons for not accepting 
them. 

However papers on classic problems that have been proved unsolvable, 
such as trisecting all angles and squaring the circle, have become so 
numerous that it is not feasible for us to unravel them. Hence we have 
adopted the policy that authors will be required to show us some error 
in the classic proofs that such problems are unsolvable before we will 
weigh their attempts to solve them. 











PROBLEMS AND QUESTIONS 


Edited by 


C. W. Trigg, Los Angeles City College 


Readers of this department are invited to submit for solution problems 
believed to be new and subject-matter questions that may arise in study, in 
study, in research, or in extra-academic situations. Proposals should be 
accompanied by solutions, when available, and by such information as will 


assist the editor. Crdinarily, problems in well-known textbooks should not 
be submitted. 


Solutions should be submitted on separate, signed sheets. Figures should 
be drawn in India ink and twice the size desired for reproduction. Readers 
are invited to offer heuristic discussions in addition to formal solutions. 

Send all communications for this department to C. W. Trigg, Los Angeles 
City College, 855 N. Vermont Ave., Los Angeles 29, California. 


PROPOSALS 


112. Proposed by Victor Thebault, Tennie, Sarthe, France. 


Find a number of the form aaabbbccc which gives, when increased 
by unity, a perfect square of nine digits. 


113. Proposed by Benjamin Greenberg, Brooklyn, N. Y. 


Isosceles triangles with base angles of 30° are constructed ex- 
ternally on the sides of triangle ABC. The third vertices of the 
isosceles triangles determine an equilateral triangle. Can this be 
proven by pure synthetic geometry without recourse to trigonometry? 


114. Proposed by Dewey Duncan, East Los Angeles Junior College. 


a) In air a gold surfaced sphere weighs 7588 grams. It is known 
that it may contain one or more of the metals aluminum, copper, silver 
or lead. When weighed successively under standard conditions in water, 
benzene, alcohol, and glycerine its respective weights are 6588, 64688, 
6778, and 6328 grams. How much, if any, of the forenamed metals does 
it contain, if the specific gravities of the designated substances are 
taken to be as follows? 


Aluminum 2.7 Alcohol 0.81 
Copper 8.9 Benzene 0.90 
Gold 19,3 Glycerine 1.26 
Lead 11.3 Water 1.00 
Silver 10.5 


b) If the ball is given a very thin coating of plastic, to prevent 
amalgamating, how far will it sink when floating in mercury, specific 
gravity 13.6? 
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115. Proposed by Leo Moser, Texas Technological College. 


If all the faces of a polyhedron have central symmetry, prove that 


at least 6 of the faces are parallelograms. (A parallelopiped has exactly 
6 such faces. ) 


116. Proposed by H. H. Berry, University of Kentucky. 


If a, 6, and c are integers such that a + b = c and the least common 
multiple of a and 6 is M, where ab/(a,b) = M, then (c,M) = (a,b). 
[The symbol (a, b) represents the greatest common divisor of a and },| 


117. Proposed by H. D. Grossman, New York, N.Y. 


If n is a positive integer, prove that: a) The maximum difference 
between any two of the four suns, 4 + 7] + (3 # cee. 7] + (3) + 
0} la le Is 


(3| $otee ee, 3] + 7] + (T +e As gin/2] where [n/2] means the 


greatest integer <n/2. a 
8) The maximum difference between any two of the six sums, {| + Gh 


n + ee n 6 7| ce (z3] * ees. bats (3) ak + io + 7 is 
12 ak | 7 13 5 11 17 
gg" -"* 12 os ie even, or gim-')/2 sf n is odd. 


118. Proposed by A. M. McKinney, Dallas, Texas. 


Can the following integrals be evaluated? a) Je*sec x dx; 
b) fe'*sec x dx, without using the relation e'* = cos x + i sin x; 
c) fe-**sec x dx, without using the relation e"'* = cos x - 1 sin *%. 


SOLUTIONS 


Late Solution 


85. Erich Michalup, Caracas, Venezuela. 


Parallel Tangents to a Quartic 


89. [Jan. 1951] Proposed by H. T, R. Aude, Colgate University. 


If the graph of the quartic y = f(x) = x* + px* + qx + s has points 
of inflection, then there exist uniquely three pairs of parallel lines 
which are tangent to the quartic. Find the equations of the six lines, 
and note that the sum of their three slopes is 3. 


Solution by Alan Wayne, Flushing, N.Y. Clearly “points of in- 
flection” refers to “real points of inflection” which exist if and 
only if p is negative. Also, we note that it is desired to show that 
there exist uniquely three sets of precisely two parallel lines tangent 
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to the quartic. 

This problem has been solved, at least implicitly, by J. S. Frame 
(Tangent triangles to a biquadratic curve, American Mathematical Monthly, 
51, 445-450, (1944)]. We make use of Frame’s results and apply a simple 
calculation. By means of the “vertical affine” transformation 


1 
X= tied e, Y = (36/p*)(y - qx -s) +9, (1) 


the given quartic is reduced to the “normal form” 
Y = F(X) = X* ~ 6X2 + 9, (2) 


with preservation of collinearity of points, concurrency or parallelism 
of lines, mdpoints, centroids, tangents and contact points, and ratios 
of distances along parallel] lines. 

Let T,, T, and T, denote respectively the bitangent, the left in- 
flectional tangent, and the right inflectional tangent of the: normal 
quartic defined by (2), and let U., (i = 1, 2, 3), be the tangent to the 
quartic which is parallel to T;. Then (7;, U;), as Frame shows, are 
the only three sets in which there afe two (instead of three) parallel 
tangents. These lines have the following equations: 


T, Y= 0, U.: Y=9 
T,: Y= 8X +12. U,: Y«8xX-15 
T,: Y= -8X+ 12. Uy: Ye 8X -15 


Denote the images: of these lines, respectively, by t, and u;, under 
the transformation (1) which takes the quartic from normal form back 
into the “general” form. The desired equations are then 


my =q, =mM,=q + 8(-p/6)°/?, m, = q- R(-p/6)*’ , 


b, = s - (p*/4), b= by= s + (p*/12), 


c, = 8s and C= C45 8 - (2p?/3). 


Thus m, + m, + m, = 3q. 
Also solved by the proposer. 


For other properties of the graph of the quartic polynomial see 
American Mathematical Monthly, 37, 510, (1930); 56, 106, 165-170, 
(1949); National Mathematics Magazine, 15, 429, (May 1941); and This 
MAGAZINE, 22, 73-76, (Nov.-Dec. 1948). 


A Square Covered by Three Unit Squares 


93. (March 1951] Proposed by Leo Moser, Texas Technological College. 
What is the size of the largest square that can be completely covered 
by three unit squares? 
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Figure i 


Solution by Michael Goldberg, Washington, D.C. Place one small square 
in a corner of the large square. Place the other two small squares 
so that a vertex of each is hinged at the opposite vertex of the large 
square, overlapping each other as shown in Figure 1. These small squares 
should lie so that their edges pass through the other vertices of the 
large square and the nearby vertices of the smal] square in the corner. 

Let the edge of the large square be represented by e. Then the 
lengths of other line segments are functions of e as represented in 
Figure 1. From the similarity of the two right triangles, we have 


e/1 = (e - 1)/(1 - J e2 » k). 


Hence 





ef eet -120 or e=4(1 V¥S)/2 & 1.272. 


Also solved by Leon Bankhoff, Los Angeles, California; and T. C. 
Wilderman, Baltimore, Md. 

Bankhoff also offered a solution for the case where overlapping is 
not permitted. He placed two unit squares side by side and rested 
the third square symmetrically on the other two as in Figure 2. He 
then found the diagonal of the largest square that can be inscribed 
within the three-square configuration. Assuming that the sides of the 
desired square pass through A and B, the locus of one end (P) of the 
diagonal will be the semicircle AGB (radius = 1/2) on AB. The locus 
of the other end (Q) of the diagonal will be the line CE. Since the 
diagonal bisects angle APB it passes through F the extremity of the 
diameter FG perpendicular to AB. Let x be the length of the diagonal 
and represent angle PFG by @. Then 


x = PF + FG = cos @ + (sec 8)/2. 
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Setting dx/dé@ = 0, we have sec 6 tan 6 — 2sin 6 = 0, from which 


cos 6 = 1/72, 6 = 45°, x = /2 (Minimum diagonal) 


sin 6 = 0, 6=0 x = 3/2 


(Maximum diagonal). 


Consequently the side of the maximum square which can be completely 
covered by three non-overlapping squares is 3 2/42 1.061. 
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Wilderman also offered a synmetrical configuration in which vertices 
of the three overlapping squares coincided and the diagonals made 
angles of 120° with each other as in Figure 3. This arrangement contains 
two relatively maximum squares. The diagonal of one, ABCD, falls 
along a diagonal of one of the unit squares. It has a side AB=AE sec 30° 
= 213 = 1.155 as derived from triangle AEB. The computation of the 
side of the other square, FGNJ, which has sides parallel to one of 
the unit square’s diagonals requires more labor. HK = FJ = s, AH = 
(s/2) tan 15°, and KM = KH = s/2. From triangle AOE, AO = sin45°/sin60°, 
OM = OE = sin 75°/sin60°. Now AH + HK + KM = AO + OM, so (s/2)(tanl5° + 3) 
= (sin 45° + sin 75°)/sin 60°. Hence s = (3 ¥6 + 4. 72)/11 = 1.182. 


Curve Intersections with Integer Coordinates 


96. [March 1951] Proposed by R. E. Horton, Los Angeles City College. 

Determine conditions on the integers p and a such that both coor- 
dinates of the points of intersection of x* + y* = a® and y* = 2px will 
be integers. 


Solution by Leon Bankoff, Los Angeles, Calif. It is well-known 
that x = 2mm, y=m - n*, a=, +n* and x = a - n*, y = 2mn, 
a =m + n* (with m, n relatively prime and of different parity) give 


primitive solutions of x* + y* = a*. Then p = y*/2x = (m* - n*)*/4mn or 
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4m*n?/2(m? - n*). But since p is: an integer, we have as general solu- 
tions: 


x= 8km?n?, y = 4kmn(m? - n*), a = 4kmn(m + n?), p = k(m® - n*)? 
and 


x = k(m? —n?)?, y = Qkmn(m? - n*), a = k(m* -7n*), p = 2km*n*, 


. 
, 


where k is an integer. Obviously, for every solution (a,p) there will 
be three other solutions (a, -p), (-a, p), and (-a, -p). 


Also solved by L. A. Ringenberg, Eastern Illinois State College; 
and the proposer. 


To Construct a Right Triangle, given r and R 
99. [May 1951] Proposed by T. E. Sydnor, Pasadena City College, Calif. 


a) Construct a right triangle given the inradius and the circum- 
radius. b) What conditions must be placed on r and R in order that the 
triangle shall have integer sides? 














D 


Figure 1 Figure 2 


Solution by L. A. Ringenberg, Eastern Illinois State College. a) 
Construct an angle of magnitude 7/4 and call its vertex D, as in Figure 
1. On one side lay off DA = 2R + 2r. With center A and radius 2R describe 
a circle cutting the other side of the angle in B and B’. From B and 
B' drop perpendiculars BC and B’C’ to DA. Then BCA (or B’C'A) is the 
required triangle. 

Obviously, circumradius of the triangle is R. It remains to show 
that its inradius is r. Let AB = c, BC = a, and CA = b. Since DC’ = C'B' 
and DC = CB, it follows that a + b = 2r + 2R, whence a* + b* + 2ab 
= 4r(r + 2R) + 4R*. Now a* + b? = c? = 4R*, so the area K = ab/2 = 
r(r +2R). But the area is also equal to the semiperimeter times the 
inradius. Therefore, since the semiperimeter is r + 2R, the inradius 
is r. 

It is clear from the construction that there will be two, one, or 
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> 
no solutions according as R 2 (1 + /2)r. The congruency of triangles 


BCA and B'C'A is easily established. 
We further observe that the sides a, 6, c may be expressed in terms 


of r and R. c = 2R, and a* = c* — b* = 4R* ~- (2r + 2R - a)?. It follows 
that a(or 6) = (R +r) ¢ | ~ 2Rr - r*. 


b) The last expression implies certain restrictions on r and R in 
order that the sides may be integers. However, it is more expeditious 
to employ the well-known parametric solution a = 2wuv, 6 = wu? - v*), 
c = w(u* + v*), whereupon R = c/2 = w(u® + v?)/2 and r = (a + b - c)/2= 
wo(u- v). Here u, v, w are integers with u > v and u and v relatively’ 
prime and not both even. 


Also solved by Leon Bankoff, Los Angeles, California and the proposer, 
both of whom used the construction method of Figure 2. There, two 
perpendicular diameters BC and ED were drawn in a circle of radius 
R. A second circle was described with center D and radius DB, Parallel 
to BC and at a distance r from DB a line XY cuts the second circle 
in P and P’ the incenters. Tangents from B and C to the incircle complete 
the construction. 





An Illegitimate Operation 


103. [May 1951] Proposed by M. S. Klamkin, Polytechnic Institute 
Brooklyn, N.Y. 


A high-school student solved the linear differential equation 
dy/dx + Py = Q for y as if it were an ordinary algebraic equation. 
Under what conditions could this procedure have yielded a correct 
solution of the differential equation? 


I. Solution by P. B. Jordain, Columbia University, New York. 
Solving 


dy/dx + Py = Q (1) 


as an ordinary algebraic equation, the high-school student obviously 
cancelled the d’s and got 


y = Qx/(1 + Px) (2) 


with the condition that P # -1/x. 
If (2) is to be a solution of (1), it must satisfy (1) identically. 
From (2) we obtain 


y’ = (Q + Q'x + PQ'x? - P’Qx?)/(1 + Px)’. (3) 
Substituting the values (2) and (3) in (1) and simplifying we get 


Q'/Q = (P+ P'x)/(1 + Px). 
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Thus 
log Q= log k(1 + Px). 


It follows that y = Q/(1 + Px) will be a solution of (1) if P and Q 
satisfy the general relation Q = k(1 + Px). 


II. Solution by R, E, Winger, Los Angeles City College. The 
high-school student presumably cancelled the d’s, getting y/x + Px = Q, 
If this is to be a correct solution of the differential equation, then 
y/x = dy/dx. That is, y * cx, an almost trivial result. When this is 
substituted into the original equation (in either form) we get Q = c(1 + 
Px), the necessary relation between P and Q. 


Also solved by L. A. Ringenberg, Eastern Illinois State College; 
and the proposer. 


QUICKIES 


From time to time this department will publish problems which may be 
solved by laborious methods, but which with the proper insight may be dis- 
posed of with dispatch. Readers are urged to submit their favorite problems 
of this type, together with the elegant solution and the source, if known. 


Q 48. Mentally multiply 38 by 32. 


Q 49. n great circles on a sphere will, in general, intersect in 
n(n - 1) points. Show how to place the numbers 1, 2, ***, n(n - 1) on 
these points in such a way that the sum of the numbers on every great 
circle is the same. [Submitted by Leo Moser.] 


Q 50. Show that it is impossible to find integer solutions of x* + 


y4 + 24 - xdy? - y422 - 24x? + x2y2z2 = 0. [Emanuel Lasker in Scripta 
Mathematica, 8, 14, (Marxh 1941).] 


ANSWERS 
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aie gousy 103985 uOUMOD OU aAeYy z ‘A ‘x YeYI powNsse oq ABM Izy “OE V 

"(1 - 4)[T + (T = ¥)u] 09 [e909 TTT eToITO yoee uo sraqunu ayy 
‘squtod aytsoddo AT[eots}ouRTp FO sated [| - u SUTeJUOD BTIITO YORE BOUTS 
‘paysneyxe are (squtod ey [Te pue) srequnu 943 [Te [TIuN Aem styQ ut 
anut3 U0) *qutod aytsoddo AT[eOtAQoWeTp 24 UO ‘p-T+(I -u)u = q 
‘ quoweTdwos |, sjt aoeTd pue qutod Aue uo pv Jequnu Aue 2°987q ‘6b V 
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THE PERSONAL SIDE OF MATHEMATICS 


This department desires especially articles showing what mathematics means 
to people in various professions and historical articles showing what classic 
mathematics meant to those who developed it. Material intended for this 
Department should be sent to the Mathematics Magazine, 14V68 Van Nuys Blvd., 
Pacoima, California. 


WHAT MATHEMATICS MEANS TO ME 


Lewis Bayard Robinson 


The author when a collegian at Johns Hopkins was haunted by that 
question: 

What is truth? 

And he knew that the motto of his college was: 

“Veritas vos liberabit.” 

Therefore he resolved to become a mathematician because of ail the 
sciences mathematics is the most rigorous and can best tell us what is 
truth, 

The ancient Egyptians developed arithmetic and geometry for practical 
purposes. But this deeply religious people, as far as we know, did not 
see the mystical aspect of mathematics. The Babylonians, going a step 
further, were the best astronomers of antiquity coming very close to 
jiscovering the law of gravitation. But from the time at least of 
Pythagoras, if not earlier, the Greeks saw the intimate connection of 
mathematics and metaphysics. And ever since the sixth century B.C., 
a century of great and rapid change, the metaphysical aspect of math- 
ematics has never been completely lost to sight. 

But all men are not mystics. The military engineer, important in the 
age of the two horned Alexander is, alas, important today. The practical 
sides of mathematics, applications to physics and other utilitarian 
sciences have tended to crowd out the metaphysical aspects. 

The author is studying that profound work of Bergson: 

‘Durée et Simultaneite a propos de la Theorie d’Einstein”. 

It makes him feel that in the future mathematics will more and more 
enter the domain of metaphysics until it remakes that science as it has 
remade’ physics. Time, said Bergson, is non-homogeneous .” If unable to 
go to sleep at night the author likes to meditate on the possibility 
of constructing a mathematical theory where time and space are non- 
homogeneous. Now space-time is a multitude of simultaneous events, 
in number infinite, and each flowing with a different velocity. Perhaps 
the study of functions depending on variables, infinite in number, 
will set us on the road to discovery. 


Tarn—The Greeks in Bactria and India, p. 44. 
Time and Free Will. 


Baltimore, Maryland 
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